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METHODS AND MICROORGANISMS FOR THE PRODUCTION OF 3-(2- 
HYDROXY.3-METHYL.BUTYRYLAMlNO)-PROPIONIC ACID (HMBPA) 

Related Applications 

5 The present invention claims the benefit of prior-filed provisional Patent 

Application Serial No. 60/263,053, filed January 19, 2001 (pending). The present 
invention is also related to U.S. Patent Application Serial No. 09/667,569, filed 
September 21, 2000 (pending), which is a continuation-in-part of U.S. Patent 
Application Serial No. 09/400,494, filed September 21, 1999 (abandoned). U.S. Patent 
10 Application Serial No. 09/667,569 also claims the benefit of prior-filed provisional 
Patent Application Serial No. 60/210,072, filed June 7, 2000, provisional Patent 
Application Serial No. 60/221,836, filed July 28, 2000, and provisional Patent 
Application Serial No, 60/227,860, filed August 24, 2000. The entire content of each of 
the above-referenced applications is incorporated herein by this reference. 

15 

Badqsround of the Invention 

Conventional means of synthesizing chemical compomds is via synthesis 
firom bulk chemicals, a process which is limited by factors such as substrate availability 
and/or cost, difficulty in resolving complex mixtures of products, complexities in 

20 synthesizing large quantities of compounds in purified form, and difficulty in producing 
chiral compounds. Accordingly, researchers have recently looked to bacterial or 
microbial systems that express enzymes usefiil for various biosynthetic processes, for 
example, in the synthesis of pharmaceutical compounds, research reagents, 
nutriceuticals, vitamins, nutritional supplements, antibiotic compounds and the like. In 

25 particular, bioconversion processes have been evaluated as a means of favoring 

production of preferred compounds and recently methods of direct microbial synthesis 
have been the focus of much research in the areas of pharmaceuticals and agriculture. 

Summary of the Invention 

30 The present invention relates to a processes for the direct microbial 

synthesis of [R]-3-(2-hydroxy-3-methyl-but3nrylanndno)-propionic acid or 3-(2-hydroxy- 
3-methyl-butyrylamino)-propionic acid ("HMBPA"), referred to interchangeably herein 
as "p-alanine 2-(i2)-hydroxyisolvalerate", "P-alanine 2-hydroxyisolvalerate", "P-alanyl- 
a-hydroxyisovalarate", N-(2-hydroxy-3-methyl-l-oxobutyl)-p-alanine ("HMOBA") 
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and/or "fantothenate". In particular, it has been discovered that in microorganisms 
engineered to overexpress certain en2ymes conventionally associated with pantothenate 
and/or isoleucine-valine (ilv) biosynthesis, an alternative biosynthetic pathway is present 
that competes for key precursors of pantothenate biosynthesis, namely a-ketoisovalerate 
5 (a-KIV) and p-alanine. a-KIV is converted to a-hydroxyisovalerate (a-HTV) catalyzed 
by various reductase enzymes and a-HIV is subsequently condensed with p-alanine to 
produce HMBPA. 

In one embodiment, the invention features a process for the production of 
3-(2-hydroxy-3-methyl-butyrylamino)-propionic acid (HMBPA) that includes culturing 

10 a microorganism having increased keto reductase activity or increased pantothenate 
synthetase activity in the presence of excess a-ketoisovalerate and excess P-alanine, 
such that HMBPA is produced. In another embodiment, the invemtion features a 
process for the production of 3-(2-hydroxy-3-methyI-butyrylamino)-propionic acid 
(HMBPA) that includes culturing a microorganism having increased keto reductase 

1 5 activity and increased pantothenate synthetase activity in the presence of excess a- 
ketoisovalerate and excess p-alanine, such that HMBPA is produced. In one 
embodiment, the microorganism has a modified panE gene, for example, a modified 
panEl gene and/or a modified panE2 gene (e.g., the panE gene is overexpressed, 
deregulated or present in multiple copies). In another embodiment, the microorganism 

20 has a modified panC gene (e.g., the panC gene is overexpressed, deregulated or present 
in multiple copies). In another embodiment, the microorganism fiirther has increased 
acetohydroxyacid isomeroreductase activity. In another embodiment, the 
microorganism is cultured under conditions of increased acetohydroxyacid 
isomeroreductase activity in the presence of excess a-ketoisovalerate and excess p- 

25 alanine, such that HMBPA is produced. In yet another embodiment, the microorganism 
comprises a modified ilvC gene (e.g., the ilvC gene is overexpressed, deregulated or 
present in multiple copies). In yet another embodiment, the microorganism further has 
reduced ketopantoate hydroxymethyltransferase activity (e.g., has a modified panB gene, 
for example a panB gene that has been deleted. 

30 In another aspect, the invention features a process for the production of 3- 

(2-hydroxy-3-methyl-butyrylamino)-propionic acid (HMBPA) that includes culturing a 
microorganism having reduced ketopantoate hydroxymethyltransferase activity in the 
presence of excess a-ketoisovalerate and excess P-alanine, such that HMBPA is 
produced. In another aspect, the invention features a method for enhancing production 
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of 3-(2-hydroxy-3-methyl-butyrylamino)-propionic acid (HMBPA) relative to 
pantothenate that includes culturing a recombinant microorganism imder conditions such 
that the HMBPA production is enhanced relative to pantothenate production. In another 
aspect, the invention featm-es a process for the production of 2-hydroxyisovaleric acid 
5 (a-HTV) that includes culturing a microorganism which overexpresses PanEl or PanE2 
and which further has reduced PanC or PanD activity under conditions such that a-HIV 
is produced. In another aspect, the invention features a process for the production of 3- 
(2-hydroxy-3-methyl-butyrylamino)-propionic acid (HMBPA) that includes culturing a 
recombinant microorganism having decreased expression or activity of serA or glyA 

LO under conditions such that HMBPA is produced. In another aspect, the invention 

features a process for the production of 3-(2-hydroxy-3-methyl-butyrylamino)-propionic 
acid (HMBPA) that includes culturing a recombinant microorganism having decreased 
expression or activity of serA and glyA under conditions such that HMBPA is produced. 
Conditions for culturing the above described microorganisms include, for example, 

15 conditions of increased steady state glucose, conditions of decreased steady state 
dissolved oxygen, and/or cultured under conditions of decreased serine. Products 
produced according to the above described processes and/or methods are also featured. 
Also featured are recombinant microorganisms utilized in the above-described methods. 

Compounds produced according to the methodologies of the present 

20 invention have a variety of uses. For example, HMBPA can be used to synthesize 
. inhibitors of HMG CoA Reductase (II) (Gordon et al Bio. Med. Chem. Lett, 1(3):161 
(1991). Inhibitors of HMG CoA Reductase (II) have been studied for use as in the 
treatment of hypercholesterolaemia and coronary atherosclerosis progression. Inhibitors 
of HMG CoA Reductase also have been used to reduce risk of cardiovascular events in 
, 25 patients at risk. Moreover, the HMBPA precursor 2-hydroxyisovalerate (a-HIV) has 
been demonstrated to have nutriceutical properties, for example, in the prevention of 
aging of the skin. In particular, a-hydroxy acids, such as a-HIV (or 2-hydroxyvaIine), 
can be used to synthesize a-hydroxy esters which have been found to induce increased 
skin thickness by increasing biosyntheses of glycosaminoglycans, proteoglycans, 

30 collagen, elastin, and other dermal components. The compounds can be used to treat 
skin disorders such as age spots, skin lines, wrinkles, photoaging and aging. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 
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Brief Description of the Drawings 

Figure 7 is a schematic representation of the pantothenate and isoleucine- 
valine (//v) biosynthetic pathways. Pantothenate biosynthetic enzymes are depicted in 
bold and their corresponding genes indicated in italics. Isoleucine-valine (/7v) 
5 biosynthetic en2ymes are depicted in bold italics and their corresponding genes indicated 
in italics. 

Figure 2 is a schematic representation of the biosynthetic pathway 
leading to [R]-3-(2-hydroxy-3-methyl-butyrylaniino)-propiomc acid ("HMBPA") in B. 
subtilis. 

10 Figure J is a schematic depiction of the structure of [R]-3-(2-hydroxy-3- 

methyl-butyrylamino)-propionic acid ("HMBPA"). 

Figure ¥ is a HPLC chromatogram of a sample of medium from a 14 L 
fermentation of PA824. 

Figure 5 is a mass spectrum depicting the relative monoisotopic mass of 
15 [R]-3-(2-hydroxy-3-methyl-butyrylamino)-propiomc acid. 

, Figure 6 depicts an alignment of the C-terminal amino acids from known 

or suspected PanB proteins. 

Figure 7 is a schematic representation of the construction of the plasmid 

pAN624. 

20 Figure 5 is a schematic representation of the construction of the plasmid 

pAN620. 

Figure P is a schematic representation of the construction of the plasmid 

pAN636. 

Figure iO is a schematic representation of the construction of the plasmid 
25 pAN637 which allows selection for single or multiple copies using chloramphenicol. 

Figure 77 is a schematic representation of the construction of the plasmid 
pAN238, a plasmid for overexpressing B, subtilis panE2 from the P26 promoter. 

Detailed Description of the Invention 

30 The present invention is based, at least in part, on the discovery of a novel 

biosynthetic pathway in bacteria, namely the [R]-3-(2-hydroxy-3-methyl-butyrylaniino)- 
propionic acid ("HMBPA") biosynthetic pathway. In particular, it has been discovered 
that bacteria are capable of generating HMBPA from a-ketoisovalerate (a-KIV), a key 
product of the isoleucine-valine (//v) biosynthetic pathway and precursor of the 
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pantothenate biosynthetic pathway. Production of HMBPA in bacteria involves at least 
the pantothenate biosynthetic enzymes ketopantoate reductase (the panEl gene product) 
and/or acetohydroxyacid isomeroreductase (the ilvC gene product) and resuhs from the 
condensation of 2-hydroxyisovaleric acid (a-HIV), formed by reduction of a-KTV, and 
5 p-alanine, the latter reaction being catalyzed by the pantothenate biosynthetic enzyme 
pantothenate synthetase (the panC gene product). Production of HMBPA is achieved by 
increasing ketopantoate reductase {e.g., PanEl) and/or PanE2 and/or acetohydroxyacid 
isomeroreductase activities {e.g., IlvC) in microorganisms, for example, by 
overexpressing or deregulating the genes encoduig said enzymes. Optimal production of 

10 HMBPA is achieved by decreasing or deleting ketopantoate hydroxymethyltransferase 
activity (the panB gene product) in microorganisms, for example, by modifying or 
deleting the panB gene which encodes ketopantoate hydroxymethyltransferase (e.g., 
PanB), optionally in addition to increasing ketopantoate reductase and/or PanE2 and/or 
acetohydroxyacid isomeroreductase activities in said microorganisms. The substrates a- 

15 KIV and P-alanine are required for HMBPA production, the latter provided, for 

example, by p-alanine feeding and/or increased aspartate-a-decarboxylate activity (the 
panD gene product). Increasing substrate concentration (/.e., a-KIV and/or p-alanine) 
further enhances production of HMBPA. a-KIV production can be increased by 
overexpressing ilvBNCD genes and/or alsS. HMBPA production can further be 

20 increased by limiting serine availability or synthesis in appropriately engineered- 
microorganisms. 



In order that the present invention may be more readily understood, 
25 certain terms are first defined herein. 

The term "pantothenate biosynthetic pathway" includes the biosynthetic 
pathway involving pantothenate biosynthetic enzymes (e.g., polypeptides encoded by 
biosynthetic enzyme-encoding genes), compounds (e.g., precursors, substrates, 
intermediates or products), cofactors and the like utilized in the formation or synthesis of 
30 pantothenate. The term "pantothenate biosynthetic pathway" includes the biosynthetic 
pathway leading to the synthesis of pantothenate in microorganisms (e.g., in vivo) as 
well as the biosynthetic pathway leading to the synthesis of pantothenate in vitro. 
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The term "pantothenate biosynthetic enzyme" includes any enzyme 
utilized in the foraiation of a compoimd (e.g., intermediate or product) of the 
pantothenate biosynthetic pathway. For example, synthesis of pantoate from a- 
ketoisovalerate (a-KIV) proceeds via the intermediate, ketopantoate. Formation of 
5 ketopantoate is catalyzed by the pantothenate biosynthetic enzyme ketopantoate 

hydroxymethyltransferase (the panB gene product). Formation of pantoate is catalyzed 
by the pantothenate biosynthetic enzyme ketopantoate reductase (the panE gene 
product). Synthesis of p-alanine from aspartate is catalyzed by the pantothenate 
biosynthetic enzyme aspartate-a-decarboxylase (the panD gene product). Formation of 

10 pantothenate from pantoate and p^alanine (e.g. , condensation) is catalyzed by the 
pantothenate biosynthetic enzyme pantothenate synthetase (the panC gene product). 
Based on the newly discovered HMBPA biosynthesis pathway, pantothenate 
biosynthetic enzymes may also perform an alternative function as enzymes in the 
HMBPA biosynthetic pathway described herein. 

1 5 The term "pantothenate" includes the free acid form of pantothenate, also 

referred to as "pantothenic acid" as well as any salt thereof (e.g., derived by replacing 
the acidic hydrogen of pantothenate or pantothenic acid with a cation, for example, 
calcium, sodium, potassium, ammonium), also referred to as a "pantothenate salt". The 
term "pantothenate" also includes alcohol derivatives of pantothenate. Preferred 

20 pantothenate salts are calcium pantothenate or sodium pantothenate. A preferred alcohol 
derivative is pantothenol. Pantothenate salts and/or alcohols of the present invention 
include salts and/or alcohols prepared via conventional methods from the free acids 
described herein. In another embodiment, calcium pantothenate is synthesized directly 
by a microorganism of Ihe present invention. A pantothenate salt of the present 

25 invention can likewise be converted to a free acid form of pantothenate or pantothenic 
acid by conventional methodology. 

The term "isoleucine-valine biosynthetic pathway" includes the 
biosynthetic pathway involving isoleucine-valine biosynthetic enzymes (e,g., 
polypeptides encoded by biosynthetic enzyme-encoding genes), compounds (e.g., 

30 precursors, substrates, intermediates or products), cofactors and the like utilized in the 
formation or synthesis of conversion of pyruvate to valine or isoleucine. The term 
"isoleucine-valine biosynthetic pathway" includes the biosynthetic pathway leading to 
the synthesis of valine or isoleucine in microorganisms (e.g., in vivo) as well as the 
biosynthetic pathway leading to the synthesis of valine or isoleucine in vitro. Figure 1 



-6- 



wo 02/057476 



PCT/US02/01887 



includes a schematic representation of the isoleucine-valine biosynthetic pathway. 
Isoleucine-valine biosynthetic enzymes are depicted in bold italics and their 
corresponding genes indicated in italics 

The term "isoleucine-valine biosynthetic enzyme" includes any enzyme 
5 utilized in the formation of a compound (e.g., intermediate or product) of the isoleucine- 
valine biosynthetic pathv^ay. According to Figure 1, synthesis of valine from pyruvate 
proceeds via the intemiediates, acetolactate, a,P-dihydroxyisovalerate (a^P-DHTV) and 
a-ketoisovalerate (a-KIV). Formation of acetolactate from pyruvate is catalyzed by the 
. isoleucine-valine biosynthetic enzyme acetohydroxyacid synthetase (the ilvBN gene 

1 0 product, or alternatively, the alsS gene product). Formation of a^P-DHIV from 
acetolactate is catalyzed by the isoleucine-valine biosynthetic enzyme 
acetohydroxyacidisomero reductase (the ilvC gene product). Synthesis of a-KIV from 
a,p-DHIV is catalyzed by the isoleucine-valine biosynthetic enzyme dihydroxyacid 
dehydratase (the ilvD gene product). Moreover, valine and isoleucine can be 

1 5 interconverted vsdth their respective a-keto compounds by branched chain amino acid 
transaminases. Based on the newly discovered HMBPA biosynthesis pathway, 
isoleucine-valine biosynthetic enzymes may also perform an altemative fimction as 
enzymes in the HMBPA biosynthetic pathway described herein. 

The term "3-(2-hydroxy-3-methyl-butyrylamino)-propionic acid 

20 ("HMBPA") biosynthetic pathway" includes the altemative biosynthetic pathway 
involving biosynthetic enzymies and compounds (e.g. , substrates and the like) 
traditionally associated with the pantothenate biosynthetic pathway utilized in the 
formation or synthesis of HMBPA. The term "HMBPA biosynthetic pathway" includes 
the biosjmthetic pathway leading to the synthesis of HMBPA in microorganisms (e.g., in 

25 vivo) as well as the biosynthetic pathway leading to the synthesis of HMBPA in vitro. 
The term "HMBPA biosynthetic enzyme" includes any enzyme utilized 
in the formation of a compound (e.g, intermediate or product) of the HMBPA 
biosynthetic pathway. For example, synthesis of 2-hydroxyisovaleric acid (a-HTV) from 
a-ketoisovalerate (a-KIV) is catalyzed by the panEl or panE2 gene product (PanEl, 

30 alternatively referred to herem ketopantoate reductase or PanE2, a a-ketoacid reductase 
that does not significantly contribute to pantothenate biosynthesis) and/or is catalyzed by 
the ilvC gene product (altematively referred to herein as acetohydroxyacid 
isomeroreductase). Formation of HMBPA from p-alanine and a-HIV is catalyzed by the 
panC gene product (altematively referred to herein as pantothenate synthetase). 
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The term "3-(2-hydroxy-3-methyl-butyiylamino)-propiomc acid 
("HMBPA")" includes the free acid form of HMBPA, also referred to as "3-(2-hydroxy- 
3-methyl-but3aylamino)-propionate" as well as any salt thereof (e.g., derived by 
replacmg the acidic hydrogen of [R]-3-(2-hydroxy-3-methyl-butyrylamino)-propionic 
5 acid or [R]-3-(2-hydroxy-3-methyl-butyrylamino)-propionate with a cation, for example, 
calcium, sodium, potassium, ammonium), also referred to as a "3"(2-hydroxy-3-methyl- 
butyrylamino)-propionic acid salf or "HMBPA salt". Preferred HMBPA salts are 
calcium HMBPA or sodiimi HMBPA. HMBPA salts of the present invention include 
salts prepared via conventional methods from the free acids described herein. An 
10 HMBPA salt of the present invention can likewise be converted to a free acid form of 
[R]-3-(2-hydroxy-3-methyl-butyrylamino)-propionic acid or [R]-3-(2-hydroxy-3-methyl- 
butyrylamino)-propionate by conventional methodology. 

Various aspects of the invention are described in fiirther detail in the 
15 following subsections. 

L Targeting Genes Encoding Various Pantothenate and/or 

Isoleiicine'Valine(ilv) and/or HMBPA Biosynthetic Enzymes 

In one embodiment, the present invention features targeting or modifying 

20 various biosynthetic en2)rmes of the pantothenate and/or isoleucine-valine(//v) and/or 
HMBPA biosynthetic pathways. In particular, the invention features modifying various 
enzymatic activities associated with said pathways by modifying or altering the genes 
encoding said biosynthetic enzymes. 

The term "gene", as used herein, includes a nucleic acid molecule (e.g., a 

25 DNA molecule or segment thereof) that, in an organism, can be separated from another 
gene or other genes, by intergenic DNA (/.e., intervening or spacer DNA which naturally 
flanks the gene and/or separates genes in the chromosomal DNA of the organism). 
Alternatively, a gene may slightly overlap another gene (e.g:, the 3' end of a first gene 
overlapping the 5' end of a second gene), said overlapping genes separated from other 

30 genes by intergenic DNA. A gene may direct synthesis of an enzyme or other protein 
molecule (e.g., may comprise coding seqeunces, for example, a contiguous open reading 
frame (ORF) which encodes a protem) or may itself be functional in the organism. A 
gene in an organism, may be clustered in an operon, as defined herein, said operon being 
separated from other genes and/or operons by the intergenic DNA. An "isolated gene". 
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as used herein, includes a gene which is essentially free of sequences which naturally 
flank the gene in the chromosomal DNA of the organism from which the gene is derived 
(i.e., is free of adjacent coding sequences which encode a second or distinct protein, 
adjacent structural sequences or the like) and optionally includes 5' and 3' regulatory 
5 sequences, for example promoter sequences and/or terminator sequences. In one 
embodiment, an isolated gene includes predominantly coding sequences for a protein 
(e.g., sequences which encode Bacillus proteins). In another embodiment, an isolated 
gene includes coding sequences for a protein (e.g., for a Bacillus protein) and adjacent 5' 
and/or 3' regulatory sequences from the chromosomal DNA of the organism from which 

10 the gene is derived (e.g., adjacent 5' and/or 3' Bacillus regulatory sequences). 

Preferably, an isolated gene contains less than about 10 kb, 5 kb, 2 kb, 1 kb, 0.5 kb, 0.2 
kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences that naturally flank the gene in 
the chromosomal DNA of the organism from which the gene is derived. 

The term "operon" includes at least two adjacent genes or ORFs, 

15 optionally overlapping in sequence at either the 5' or 3' end of at least one gene or ORF. 
The term "operon" includes a coordinated imit of gene expression that contains a 
promoter and possibly a regulatory element associated with one or more adjacent genes 
or ORFs (e.g., structural genes encoding enzymes, for example, biosynthetic enzymes). 
Expression of the genes (e.g., structural genes) can be coordinately regulated, for 

20 example, by regulatory proteins binding to the regulatory element or by anti-termination 
of transcription. The genes of an operon (e.g. , structural genes) can be transcribed to 
give a single mRNA that encodes all of the proteins. 

A "gene having a mutation" or "mutant gene" as used herein, includes a 
gene having a nucleotide sequence which includes at least one alteration (e.g., 

25 substitution, insertion, deletion) such that the polypeptide or protein encoded by said 
mutant exhibits an activity that differs from the polypeptide or protein encoded by the 
wild-type nucleic acid molecule or gene. In one embodiment, a gene having a mutation 
or mutant gene encodes a polypeptide or protein having an increased activity as 
compared to the polypeptide or protein encoded by the wild-type gene, for example, 

30 when assayed imder similar conditions (e.g. , assayed in microorganisms cultured at the 
same temperature). As used herein, an "increased activity" or "increased enzymatic 
activity" is one that is at least 5% greater than that of the polypeptide or protein encoded 
by the wild-type nucleic acid molecule or gene, preferably at least 5-10% greater, more 
preferably at least 10-25% greater and even more preferably at least 25-50%, 50-75% or 
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75-100% greater than that of the polypeptide or protein encoded by the wild-type nucleic 
acid molecule or gene. Ranges intermediate to the above-recited values, e.g., 75-85%, 
85-90%, 90-95%, are also intended to be encompassed by the present invention. As 
used herein, an "increased activity" or "increased enzymatic activity" can also include 
5 an activity that is at least 1 .25-fold greater than the activity of the polypeptide or protein 
encoded by the wild-type gene, preferably at least 1 .5-fold greater, more preferably at 
least 2-fold greater and even more preferably at least 3-fold, 4-fold, 5-fold, 10-fold, 20- 
fold, 50-fold, 100-fold or greater than the activity of the polypeptide or protein encoded 
by the wild-type gene. 

10 In another embodiment, a gene having a mutation or mutant gene encodes 

a polypeptide or protein having a reduced activity as compared to the polypeptide or 
protein encoded by the wild-type gene, for example, when assayed under similar 
conditions (e.g., assayed in microorganisms cultured at the same temperature). A mutant 
gene also can encode no polypeptide or have a reduced level of production of the wild- • 

1 5 type polypeptide. As used herein, a "reduced activity" or "reduced enzymatic activity" 
is one that is at least 5% less than that of the polypeptide or protein encoded by the wild- 
type nucleic acid molecule or gene, preferably at least 5-10% less, more preferably at 
least 10-25% less and even more preferably at least 25-50%, 50-75% or 75-100% less 
than that of the polypeptide or protein encoded by the wild-type nucleic acid molecule or 

20 gene. Ranges intermediate to the above-recited values, e.g., 75-85%, 85-90%, 90-95%, 
are also intended to be encompassed by the present invention. As used herein, a 
"reduced activity" or "reduced enzymatic activity" can also include an activity that has 
been deleted or "knocked out" (e.g., approximately 100% less activity than that of the 
polypeptide or protein encoded by the wild-type nucleic acid molecule or gene), 

25 Activity can be determined according to any well accepted assay for 

measuring activity of a particular protein of interest Activity can be measured or 
assayed directly, for example, measuring an activity of a protein isolated or purified 
from a cell or mocroorganism. Altematively, an activity can be measured or assayed 
within a cell or mocroorganism or in an extracellular mediimi. For example, assaying 

30 for a mutant gene (z.e., said mutant encoding a reduced enzymatic activity) can be 

accomplished by expressing the mutated gene in a microorganism, for example, a mutant 
microorganism in which the enzyme is temperature-sensitive, and assaying the mutant 
gene for the ability to complement a temperature sensitive (Ts) mutant for enzymatic 
activity. A mutant gene that encodes an "increased enzymatic activity" can be one that 
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complements the Ts mutant more effectively than, for example, a corresponding wild- 
type gene. A mutant gene that encodes a "reduced en2ymatic activity" is one tihat 
complements the Ts mutant less effectively than, for example, a corresponding wild-type 
gene. 

5 It will be appreciated by the skilled artisan that even a single substitution 

in a nucleic acid or gene sequence (e.g, , a base substitution that encodes an amino acid 
change m the corresponding amino acid sequence) can dramatically affect the activity of 
an encoded polypeptide or protein as compared to the corresponding wild-type 
polypeptide or protein. A mutant gene (e.g., encodmg a mutant polypeptide or protein), 

10 as defined herein, is readily distinguishable firom a nucleic acid or gene encoding a 
protein homologue in that a mutant gene encodes a protein or polypeptide having an 
altered activity, optionally observable as a different or distinct phenotype in a 
microorganism expressing said mutant gene or producing said mutant protein or 
polypeptide (/.e., a mutant microorganism) as compared to a corresponding 

1 5 microorganism expressing the wild-type gene. By contrast, a protein homologue has an 
identical or substantially similar activity, optionally phenotypically indiscemable when 
produced in a microorganism, as compared to a corresponding microorganism 
expressing the wild-type gene. Accordingly it is not, for example, the degree of 
sequence identity between nucleic acid molecules, genes, protein or polypeptides that 

20 serves to distinguish between homologues and mutants, rather it is the activity of the 
encoded protein or polypeptide that distinguishes between homologues and mutants: 
homologues having, for example, low (e.g., 30-50% sequence identity) sequence identity 
yet having substantially equivalent functional activities, and mutants, for example 
sharing 99% sequence identity yet having dramatically different or altered functional 

25 activities. 

It will also be appreciated by the skilled artisan that nucleic acid 
molecules, genes, protein or polypeptides for use in the instant invention can be derived 
fi*om any microorganisms having a HMBPA biosynthetic pathway, an ilv biosynthetic 
pathway or a pantothenate biosynthetic pathway. Such nucleic acid molecules, genes, 
30 protein or polypeptides can be identified by the skilled artisan using known techniques 
such as homology screening, sequence comparison and the like, and can be modified by 
the skilled artisan in such a way that expression or production of these nucleic acid 
molecules, genes, protein or polypeptides occurs in a recombinant microorganism (e.g., 
by using appropriate promotors, ribosomal binding sites, expression or integration 
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vectors, modifying the sequence of the genes such that the transcription is increased 
(taking into account the preferable codon usage), etc., according to techniques described 
herein and those known in the art). 

In one embodiment, the genes of the present invention are derived from a 
5 Gram positive microorganism organism (e.g., a microorganism which retains basic dye, 
for example, crystal violet, due to the presence of a Gram-positive wall surrounding the 
microorganism). The term "derived from" (e.g., "derived from" a Gram positive 
microorganism) refers to a gene which is naturally found in the microorganism (e.g., is 
naturally found in a Gram positive microorganism). In a preferred embodiment, the 

10 genes of the present invention are derived from a microorganism belonging to a genus 
selected from the group consisting of Bacillus, Cornyebacterium (e.g., Cornyebacterium 
glutamicum), Lactobacillus, Lactococci and Streptomyces. In a more preferred 
embodiment, the genes of the present invention are derived from a microorganism is of 
the genus Bacillus. In another preferred embodiment, the genes of the present invention 

15 are derived from a microorganism selected from the group consisting of Bacillus 

subtiliSy Bacillus lentimorbus. Bacillus lentus. Bacillus firmus. Bacillus pantothenticus. 
Bacillus amyloliquefaciens. Bacillus cereus. Bacillus circulans. Bacillus coagulans. 
Bacillus licheniformis. Bacillus megaterium. Bacillus pumilus. Bacillus thuringiensis. 
Bacillus halodurans, and other Group 1 Bacillus species, for example, as characterized 

20 by 1 6S rRNA type. In another preferred embodiment, the gene is derived from Bacillus 
brevis or Bacillus stearothermophilus. In another preferred embodiment, the genes of 
the present invention are derived from a microorganism selected from the group 
consisting of Bacillus licheniformis. Bacillus amyloliquefaciem. Bacillus subtilis, and 
Bacillus pumilus. In a particularly preferred embodiment, the gene is derived from 

25 Bacillus subtilis {e.g., is Bacillus subtilis-denvQd). The term "derived from Bacillus 
subtilis'' or ''Bacillus subtilis-dcnvQd'' includes a gene which is naturally found in the 
microorganism Bacillus subtilis. Included within the scope of the present invention are 
Bacilliis-denved genes (e.g., B. subtilis-derived genes), for example. Bacillus or B. 
. subtilis coaX gems, serA genes, glyA genes, coaA g&nes, pan genes and/or ilv genes. 

30 In another embodiment, the genes of the present invention are derived 

from a Gram negative (excludes basic dye) microorganism. In a preferred embodiment, 
the genes of the present invention are derived from a microorganism belonging to a 
genus selected from the group consisting of Salmonella (e.g.. Salmonella typhimurium), 
Escherichia, Klebsiella, Serratia, and Proteus. In a more preferred embodiment, the 



-12- 



wo 02/057476 



PCT/US02/01887 



genes of the present invention are derived from a microorganism of the genus 
Escherichia. In an even more preferred embodiment, the genes of the present invention 
are derived from Escherichia coli. In another embodiment, the genes of the present 
invention are derived from Saccharomyces (e,g., Saccharomyces cerevisiae). 

5 

// Recombinant Ntwleic Acid Molecules and Vectors 
The present invention further features recombinant nucleic acid 
molecules (e.g., recombinant DNA molecules) that include genes described herein (e.g., 
isolated genes), preferably Bacillus genes, more preferably Bacillus subtilis genes, even 
1 0 more preferably Bacillus subtilis pantothenate biosynthetic genes and/or isoleucine- 
valine (//v) biosynthetic genes and/or HMBPA biosynthetic genes. The term 
"recombinant nucleic acid molecule" includes a nucleic acid molecule (e.g., a DNA 
molecule) that has been altered, modified or engineered such that it differs in nucleotide 
sequence from the native or natural nucleic acid molecule from which the recombinant 
15 nucleic acid molecule w^as derived (e.g., by addition, deletion or substitution of one or 
more nucleotides). Preferably, a recombinant nucleic acid molecule (e.g., a recombinant 
DNA molecule) includes an isolated gene of the present invention operably linked to 
regulatory sequences. The phrase "operably luiked to regulatory sequence(s)" means 
that the nucleotide sequence of the gene of interest is linked to the regulatory 
20 sequence(s) in a manner which allows for expression (e.g. , enhanced, increased, 

constitutive, basal, attenuated, decreased or repressed expression) of the gene, preferably 
expression of a gene product encoded by the gene (e.g., when the recombinant nucleic 
acid molecule is included in a recombinant vector, as defined herein, and is introduced 
into a microorganism). 

The term "regulatory sequence" includes nucleic acid sequences which 
affect (e.g., modulate or regulate) expression of other nucleic acid sequences (/.e., 
genes). In one embodiment, a regulatory sequence is included in a recombinant nucleic 
acid molecule in a similar or identical position and/or orientation relative to a particular 
gene of interest as is observed for the regulatory sequence and gene of interest as it 
appears in nature, e.g., in a native position and/or orientation. For example, a gene of 
interest can be included in a recombinant nucleic acid molecule operably linked to a 
regulatory sequence which accompanies or is adjacent to the gene of interest in the 
natural organism (e.g., operably linked to "native'' regulatory sequences (e.g., to the 
"native" promoter). Alternatively, a gene of interest can be included in a recombinant 
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nucleic acid molecule operably linked to a regulatory sequence which accompanies or is 
adjacent to another (e.g., a different) gene in the natural organism. Alternatively, a gene 
of interest can be included in a recombinant nucleic acid molecule operably linked to a 
regulatory sequence from anotlier organism. For example, regulatory sequences from 
5 other microbes {e.g. , other bacterial regulatory sequences, bacteriophage regulatory 
sequences and the like) can be operably linked to a particular gene of interest. 

In one embodiment, a regulatory sequence is a non-native or non- 
naturally-occurring sequence (e.g., a sequence which has been modified, mutated, 
substituted, derivatized, deleted including sequences which are chemically synthesized). 

10 Preferred regulatory sequences include promoters, enhancers, termination signals, anti- 
termination signals and other expression control elements {e.g., sequences to which 
repressors or inducers bind and/or binding sites for transcriptional and/or translational 
regulatory proteins, for example, in the transcribed mRNA). Such regulatory sequences 
are described, for example, in Sambrook, J., Fritsh, E. P., and Maniatis, T. Molecular 

15 • Cloning: A Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. Regulatory sequences include 
those which direct constitutive expression of a nucleotide sequence in a microorganism 
{e.g., constitutive promoters and strong constitutive promoters), those which direct 
inducible expression of a nucleotide sequence in a microorganism {e,g., inducible 

20 promoters, for example, xylose inducible promoters) and those which attenuate or 

repress expression of a nucleotide sequence in a microorganism (e.g., attenuation signals 
or repressor sequences). It is also within the scope of the present invention to regulate 
expression of a gene of interest by removing or deleting regulatory sequences. For 
example, sequences involved in the negative regulation of transcription can be removed 

25 such that expression of a gene of interest is enhanced. 

In one embodiment, a recombinant nucleic acid molecule of the present 
invention includes a nucleic acid sequence or gene that encode at least one bacterial gene 
product (e.g., a pantothenate biosynthetic enzyme, an isoleucine-valine biosynthetic 
enzyme and/or a HMBPA biosynthetic enzyme) operably linked to a promoter or 

30 promoter sequence. Preferred promoters of the present inveiition include Bacillus 
promoters and/or bacteriophage promoters (e.g., bacteriophage which infect Bacillus). 
In one embodiment, a promoter is a Bacillus promoter, preferably a strong Bacillus 
promoter (e.g. , a promoter associated with a biochemical housekeeping gene in Bacillus 
or a promoter associated with a glycolytic pathway gene in Bacillus). In another 
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embodiment, a promoter is a bacteriophage promoter. In a preferred embodiment, the 
promoter is from the bacteriophage SPOl . In a particularly preferred embodiment, a 
promoter is selected from the group consisting of P26 oxPyeg^ having for example, 
the following respective seqeunces: 
5 GCTATTGACGACAGCTATGGTTCACTGTCCACCAACCAAAACTGTGCTCAGT 
ACCGCCAATATTTCTCCCTTGAGGGGTACAAAGAGGTGTCCCTAGAAGAGAT 
CCACGCTGTGTAAAAATTTTACAAAAAGGTATTGACTTTCCCTACAGG 
GTAATAATTTAATTACAGGCGGGGGCAACCCCGCCTGT(SEQ ID N0:1), 
GCCTACCTAGCTTCCAAGAAAGATATCCTAACAGCACAAGAGCGGAAAGAT 
GTTTTGTTCTACATCCAGAACAACCTCTGCTAAAATTCCTGAAAAATT^ 
AAAAGTTGTTGACTTTATCTACAAGGTGTGGTATAATAATCTTAACAACAGC 
AGGACGC (SEQ ID N0:2), and 

GAGGAATCATAGAATTTTGTCAAAATAATTTTATTGACA^ 
GTTGATATAATTTAAATTTTATTTGACAAAAATGGGCTCGTGTTGTAC^^ ' 
AATGTAGTGAGGTGGATGCAATG (SEQ ID NO:3). Additional preferred promoters 
include tef (the translational elongation factor (TEF) promoter) and pyc (the pyruvate 
carboxylase (PYC) promoter), which promote high level expression in Bacillus (e.g., 
Bacillus subtilis). Additional preferred promoters, for example, for use in Gram positive 
microorganisms include, but are not limited to, amy and SP02 promoters. Additional 
preferred promoters, for example, for use in Gram negative microorganisms include, but 
are not limited to, cos, tac, trp, tet^ trp-tet, Ipp, lac^ Ipp-lac, lacIQ, T7, T5, T3, gal, trc, 
ara, SP6, X-PR or >.-PL. 

In another embodiment, a recombinant nucleic acid molecule of the 
present invention includes a terminator sequence or terminator sequences {e.g. , 
transcription terminator sequences). The term "terminator sequences" includes 
regulatory sequences that serve to terminate transcription of mRNA. Terminator 
sequences (or tandem transcription terminators) can ftirther serve to stabilize mRNA 
(e.g., by adding structure to mRNA), for example, against nucleases. 

In yet another embodiment, a recombinant nucleic acid molecule of the 
present invention includes sequences which allow for detection of the vector containing 
said sequences (i.e., detectable and/or selectable markers), for example, genes that 
encode antibiotic resistance or sequences that overcome auxotrophic mutations, for 
example, trpC^ fluorescent markers, drug markers, and/or colorimetric markers (e.g., 
/acZ/p-galactosidase). In yet another embodiment, a recombinant nucleic acid molecule 
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of the present invention includes an artificial ribosome binding site (RBS) or a sequence 
that becomes transcribed into an artificial RBS. The term "artificial ribosome binding 
site (RBS)" includes a site within an mRNA molecule (e.g., coded within DNA) to 
which a ribosome binds (e.g., to initiate translation) which differs from a native RBS 
5 (e.g. , a RBS found in a naturally-occurring gene) by at least one nucleotide. Preferred 
artificial RBSs include about 5-6, 7-8, 9-10, 1 1-12, 13-14, 15-16, 17-18, 19-20, 21-22, 
23-24, 25-26, 27-28, 29-30 or more nucleotides of which about 1-2, 3-4, 5-6, 7-8, 9-10, 
11-1 2, 13-15 or more differ from the native RBS (e.g., the native RBS of a gene of 
interest, for example, the native panB RBS TAAACATGAGGAGGAGAAAACATG 

10 (SEQ ID N0:4) or the native panD RBS 

ATTCGAGAAATGGAGAGAATATAATATG (SEQ ID N0:5)). 

Preferably, nucleotides that differ are substituted such that they are 
identical to one or more nucleotides of an ideal RBS when optimally aligned for 
comparisons. Ideal RBSs include, but are not limited to, AGAAAGGAGGTGA (SEQ 

15 ID N0:6), TTAAGAAAGGAGGTGANNNNATG (SEQ ID N0:7), 
TTAGAAAGGAGGTGANNNNNATG (SEQ ID N0:8), 
AGAAAGGAGGTGANNNNNNNATG (SEQ ID N0:9), and 

AGAAAGGAGGTGANNNNNNATG (SEQ ID NO: 10). Artificial RBSs can be used to 
. replace the naturally-occurring or native RBSs associated with a particular gene. 

20 Artificial RBSs preferably increase translation of a particular gene. Preferred artificial 
RBSs (e.g., RBSs for increasmg the translation of panB, for example, of B. subtilis 
panB) include CCCTCTAGAAGGAGGAGAAAACATG (SEQ ID NO: 1 1) and 
CCCTCTAGAGGAGGAGAAAACATG (SEQ ID N0:12). Preferred artificial RBSs 
(e.g., RBSs for increasmg the translation of panD, for example, of B. subtilis panD) 

25 includeTTAGAAAGGAGGATTTAAATATG(SEQIDN0:13), 
TTAGAAAGGAGGTTTAATTAATG (SEQ ID NO: 14), 
TTAGAAAGGAGGTGATTTAAATG (SEQ ID NO: 15), 

TTAGAAAGGAGGTGTTTAAAATG (SEQ ID N0:16), ATTCGAGAAAGGAGG 
TGAATATAATATG (SEQ ID N0:17), ATTCGAGAAAGGAGGTGAATAATAATG 
. 30 (SEQ ID N0:18), and ATTCGTAGAAAGGAGGTGAATTAATATG (SEQ ID NO:19). 

The present invention fiirther features vectors (e.g., recombinant vectors) 
that include nucleic acid molecules (e.g., genes or recombinant nucleic acid molecules 
comprising said genes) as described herein. The term "recombinant vector" includes a 
vector (e.g., plasmid, phage, phasmid, virus, cosmid or other purified nucleic acid 
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vector) that has been altered, modified or engineered such that it contains greater, fewer 
or different nucleic acid sequences than those included in the native or natural nucleic 
acid molecule from which the recombinant vector was derived. Preferably, the 
recombinant vector includes a biosyny thetic enzyme-encoding gene or recombinant 
5 nucleic acid molecule including said gene, operably linked to regulatory sequences, for 
example, promoter sequences, terminator sequences and/or artificial ribosome binding 
sites (RBSs), as defined herein. In another embodiment, a recombinant vector of the 
present invention includes sequences that enhance replication in bacteria (e.g., 
replication-enhancing sequences). In one embodiment, replication-enhancing sequences 

10 are derived fi'om E. coli. In another embodiment, replication-enhancing sequences are 
derived from pBR322. 

In yet another embodiment, a recombinant vector of the present invention 
includes antibiotic resistance sequences. The term "antibiotic resistance sequences" 
includes sequences which promote or confer resistance to antibiotics on the host 

1 5 organism {e.g. , Bacillus). In one embodiment, the antibiotic resistance sequences are 
selected firom the group consisting of cat (chloramphenicol resistance) sequences, tet 
(tetracycline resistance) sequences, erm (erythromycin resistance) sequences, neo 
(neomycin resistance) sequences, kan (kanamycin resistance) and spec (spectinomycin 
resistance) sequences. Recombinant vectors of the present invention can further include 

20 homologous recombination sequences (e,g. , sequences designed to allow recombination 
of the gene of interest into the chromosome of the host organism). For example, bpr, 
vpVy and/or amyE sequences can be used as homology targets for recombination into the 
host chromosome. It will fiirther be appreciated by one of skill in the art that the design 
of a vector can be tailored depending on such factors as the choice of microorganism to 

25 be genetically engineered, the level of expression of gene product desired and the like. 



jy. Recombinant Microorganisms 

The present invention further features microorganisms, le., recombinant 
30 microorganisms, that include vectors or genes (e.g., wild-type and/or mutated genes) as 
described herein. As used herein, the term "recombinant microorganism" includes a 
microorganism (e.g., bacteria, yeast cell, fungal cell, etc.) that has been genetically 
altered, modified or engineered (e.g., genetically engineered) such that it exhibits an 
altered, modified or different genotype and/or phenotype (e.g., when the genetic 
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modification affects coding nucleic acid sequences of the microorganism) as compared 
to the naturaliy-occxirring microorganism from which it was derived. 

In one embodiment, a recombinant microorganism of the present 
invention is a Gram positive organism (e.g., a nodcroorganism which retains basic dye, 
5 for example, crystal violet, due to the presence of a Gram-positive wall surrounding the 
microorganism). In a preferred embodiment, the recombinant microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci and Streptomyces, In a more preferred 
embodiment, the recombinant microorganism is of the genus Bacillus. In another . 

10 preferred embodiment, the recombinant microorganism is selected from the group 
consisting of Bacillus subtilis, Bacillus lentimorbus. Bacillus lentus. Bacillus firmus. 
Bacillus pantothenticuSy Bacillus amyloliquefaciens. Bacillus cereus. Bacillus circulanSy 
Bacillus coagulans. Bacillus licheniformis. Bacillus megaterium. Bacillus pumilus. 
Bacillus thuringiensis. Bacillus halodurans, and other Group 1 Bacillus species, for 

1 5 example, as characterized by 1 6S rRNA type. In another preferred embodiment, the 
recombinant microorganism is Bacillus brevis or Bacillus stearothermophilus. In 
another preferred embodiment, the recombinant microorganism is selected from the 
group consisting of Bacillus licheniformis. Bacillus amyloliguefaciens. Bacillus subtilis, 
and Bacillus pumilus, 

20 In another embodiment, the recombinant microorganism is a Gram 

negative (excludes basic dye) organism. In a preferred embodiment, the recombinant 
microorganism is a microorganism belonging to a genus selected from the group 
consisting of Salmonella, Escherichia, Klebsiella, Serratia, and Proteus. In a more 
preferred embodiment, the recombinant microorganism is of the genus Escherichia. In 

25 an even more preferred embodiment, the recombinant microorganism is Escherichia 
coli. In another embodiment, the recombinant microorganism is Saccharomyces (e.g., 5. 
cerevisiae). 

A preferred "recombinant" microorganism of the present invention is a 
microorganism having a deregulated pantothenate biosynthesis pathway or enzyme, a 
30 deregulated isoleucme-valine (i7v) biosynthetic pathway or enzyme and/or a deregulated 
HMBPA biosynthetic pathway or enzyme. The term "deregulated" or "deregidation" 
includes the alteration or modification of at least one gene in a microorganism that 
encodes an enzyme in a biosynthetic pathway, such that the level or activity of the 
biosynthetic enzyme in the microorganism is altered or modified. Preferably, at least 
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one gene that encodes an enzyme in a biosynthetic pathway is ahered or modified such 
that the gene product is enhanced or increased. The phrase "deregulated pathway" can 
also include a biosynthetic pathway in which more than one gene that encodes an 
enzyme in a biosynthetic pathway is altered or modified such that the level or activity of 
5 more than one biosynthetic enzyme is altered or modified. The ability to "deregulate" a 
pathway (e.g., to simultaneously deregulate more than one gene in a given biosynthetic 
pathway) in a microorganism in some cases arises fi:om the particular phenomenon of 
microorganisms in which more than one enzyme (e.^., two or three biosynthetic 
enzymes) are encoded by genes occurring adjacent to one another on a contiguous piece 

10 of genetic material termed an "operon" (defined herein). Due to the coordinated 
regulation of genes included in an operon, alteration or modification of the single 
promoter and/or regulatory element can result in alteration or modification of the 
expression of each gene product encoded by the operon. Alteration or modification of 
the regulatory element can include, but is not limited to removing the endogenous 

1 5 promoter and/or regulatory element(s), adding strong promoters, inducible promoters or 
multiple promoters or removing regulatory sequences such that expression of the gene 
products is modified, modifying the chromosomal location of the operon, altering 
nucleic acid sequences adjacent to the operon or within the operon such as a ribosome 
binding site, increasing the copy number of the operon, modifying proteins {e.g., 
■ 20 regulatory proteins, suppressors, enhancers, transcriptional activators and the like) 
involved in transcription of the operon and/or translation of the gene products of the 
operon, or any other conventional means of deregulating expression of genes routine in 
the art (including but not limited to use of antisense nucleic acid molecules, for example, 
to block expression of repressor proteins). Deregulation can also involve altering the 

25 coding region of one or more genes to yield, for example, an enzyme that is feedback 
resistant or has a higher or lower specific activity. 

In another preferred embodiment, a recombinant microorganism is 
designed or engineered such that at least one pantothenate biosynthetic enzyme, at least 
one isoleucine-valine biosynthetic enzyme, and/or at least one HMBPA biosynthetic 

30 enzyme is overexpressed. The term "overexpressed" or "overexpression" includes 
expression of a gene product (e,g, a biosynthetic enzyme) at a level greater than that 
expressed prior to manipulation of the microorganism or in a comparable microorganism 
which has not been manipulated. In one embodiment, the microorganism can be 
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genetically designed or engineered to overexpress a level of gene product greater than 
that expressed in a comparable microorganism which has not been engineered. 

Genetic engineering can include, but is not limited to, altering or 
modifying regulatory sequences or sites associated with expression of a particular gene 
5 (e.g., by adding strong promoters, inducible promoters or multiple promoters or by 
removing regulatory sequences such that expression is constitutive), modifying the 
chromosomal location of a particular gene, altering nucleic acid sequences adjacent to a 
particular gene such as a ribosome binding site, increasing the copy number of a 
particular gene, modifying proteins (e.g., regulatory proteins, suppressors, enhancers, 

1 0 transcriptional activators and the like) involved in transcription of a particular gene 
and/or translation of a particular gene product, or any other conventional means of 
deregulating expression of a particular gene routine in the art (including but not limited 
to use of antisense nucleic acid molecules, for example, to block expression of repressor 
proteins). Genetic engineering can also include deletion of a gene, for example, to block 

15 a pathway or to remove a repressor. In embodiments featuring microorganisms having 
deleted genes, the skilled artisan will appreciate that at least low levels of certain 
compounds may be required to be present in or added to the culture medium in order that 
the viability of the microorganism is not compromised. Often, such low levels are 
present in complex culture media as routinely formulated. Moreover, in processes 

20 featuring culturing microorganisms having deleted genes cultured under conditions such 
that commercially or industrially attractive quantities of product are produced, it may be 
necessary to supplement culture media with sUghtly increased levels of certain 
compounds. For example, in processes featuring culturing a microorganism having a 
deleted panB gene, at least low levels of pantothenate must be present in the rnedia, e.g., 

25 levels such as those found in routinely formxilated complex media, whereas slightly 
increased levels of pantothenate may be added to the media in order to produce 
commercially or industrially attractive amounts of, for example, HMBPA. For example, 
10-30 mg/L pantothenate can be added to the media in order to produce commercially or 
industrially attractive amounts of HMBPA. 

30 In another embodiment, the microorganism can be physically or 

environmentally manipulated to overexpress a level of gene product greater than that 
expressed prior to manipulation of the microorganism or in a comparable microorganism 
which has not been manipulated. For example, a microorganism can be treated with or 
cultured in the presence of an agent known or suspected to increase transcription of a 
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particular gene and/or translation of a particular gene product such that transcription 
and/or translation are enhanced or increased. Alternatively, a microorganism can be 
cultured at a temperatxire selected to increase transcription of a particular gene and/or 
translation of a particiilar gene product such that transcription and/or translation are 
5 enhanced or increased. 

V. Culturing and Fermenting Recombinant Microorganisms 
The term "culturing" includes maintaining and/or growing a living 
i microorganism of the present invention (e.g., maintaining and/or growing a culture or 

10 strain). In one embodiment, a microorganism of the invention is cultured in liquid 
media. In another embodiment, a microorganism of the invention is cultured in solid 
media or semi-solid media. In a preferred embodiment, a microorganism of the 
invention is cultured in media (e.g., a sterile, liquid media) comprising nutrients essential 
or beneficial to the maintenance and/or growth of the microorganism (e.g., carbon 

1 5 sources or carbon substrate, for example carbohydrate, hydrocarbons, oils, fats, fatty 
acids, organic acids, and alcohols; nitrogen sources, for example, peptone, yeast extracts, 
meat extracts, malt extracts, urea, ammonium sulfate, ammonium chloride, ammonium 
nitrate and ammonium phosphate; phosphorus sources, for example, phosphoric acid, . 
sodium and potassium salts thereof; trace elements, for example, magnesium, iron, 

20 manganese, calcium, copper, zinc, boron, molybdenvun, and/or cobalt salts; as well as 
growth factors such as amino acids, vitamins, growth promoters and the like). 

Preferably, microorganisms of the present invention are cultured under 
controlled pH. The term "controlled pH" includes any pH which results in production of 
the desired product (e.g., HMBPA). In one embodiment microorganisms are cultured at 

25 a pH of about 7. In another embodiment, microorganisms are cultured at a pH of 
between 6.0 and 8.5. The desired pH may be maintained by any number of methods 
known to those skilled in the art. 

Also preferably, microorganisms of the present invention are cultured 
under controlled aeration. The term "controlled aeration" includes sufficient aeration 

30 (e.g. , oxygen) to result in production of the desired product (e.g. , HMBPA). In one 
embodiment, aeration is controlled by regulating oxygen levels in the culture, for 
example, by regulating the amount of oxygen dissolved in culture media. Preferably, 
aeration of the culture is controlled by agitating the culture. Agitation may be provided 
by a propeller or similar mechanical agitation equipment, by revolving or shaking the 
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cuture vessel {e,g, , tube or flask) or by various pumping equipment. Aeration may be 
further controlled by the passage of sterile air or oxygen through the medium (e.g., 
through the fermentation mixture). Also preferably, microorganisms of the present 
invention are cultured without excess foaming (e.g., via addition of antifoaming agents). 
5 Moreover, microorganisms of the present invention can be cultured vmder 

controlled temperatures. The term "controlled temperature" includes any temperature 
which results in production of the desired product (e,g, HMBPA). In one embodiment, 
controlled temperatures include temperatures between 15°C and 95^C. In another 
embodiment, controlled temperatures include temperatures between 15°C and 70°C. 
10 Preferred temperatures are between 20^C and 55°C, more preferably between 30^C and 
50°C. 

Microorganisms can be cultured (e.g., maintained and/or grown) in liquid 
media and preferably are cultured, either continuously or intermittently, by conventional 
culturing methods such as standing culture, test tube culture, shaking culture (e.g., rotary 

15 shaking culture, shake flask culture, etc.), aeration spinner culture, or fennentation. In a 
preferred embodiment, the microorganisms are cultured in shake flasks. In a more 
preferred embodiment, the microorganisms are cultured in a fennentor (e.g., a 
fermentation process). Fermentation processes of the present invention include, but are 
not liniited to, batch, fed-batch and continuous processes or methods of fermentation. 

20 The phrase "batch process" or "batch fermentation" refers to a closed system in which 
the composition of media, nutrients, supplemental additives and the like is set at the 
beginning of the fermentation and not subject to alteration during the feraientation, 
however, attempts may be made to control such factors as pH and oxygen concentration 
to prevent excess media acidification and/or microorganism death. The phrase "fed- 

25 batch process" or "fed-batch" fermentation refers to a batch fermentation with the 
exception that one or more substrates or supplements are added (e.g., added in 
increments or continuously) as the fermentation progresses. The phrase "continuous 
process" or "continuous fermentation" refers to a system in which a defined fermentation 
media is added continuously to a fennentor and an equal amount of used or 

30 "conditioned" media is simultaneously removed, preferably for recovery of the desired 
product (e.g., HMBPA). A variety of such processes have been developed and are well- 
known in the art. 
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The phrase "culturing under conditions such that a desired compound is 
produced" includes maintaining and/or growing microorganisms under conditions (e.g., 
temperature, pressure, pH, duration; etc.) appropriate or sufficient to obtain production 
of the desired compound or to obtain desired yields of the particular compound being 
5 produced. For example, culturing is continued for a time sufficient to produce the 
desired amount of a compound (e.g., HMBPA). Preferably, culturing is continued for a 
time sufficient to substantially reach suitable production of the compound (e.g., a time 
sufficient to reach a suitable concentration of HMBPA or suitable ratio of 
HMBPA:pantothenate). In one embodiment, culturing is continued for about 12 to 24 

10 hours. In another embodiment, culturing is continued for about 24 to 36 hours, 36 to 48 
hours, 48 to 72 hours, 72 to 96 hours, 96 to 120 hours, 120 to 144 hours, or greater than 
144 hours. In yet another embodiment, microorganisms are cultured under conditions 
such that at least about 5 to 10 g/L of compoimd are produced in about 36 hours, at least 
about 10 to 20 g/L compoimd are produced in about 48 hours, or at least about 20 to 30 

15 g/L compound in about 72 hours. In yet another embodiment, microorganisms are 

cultured under conditions such that at least a ratio of HMBPAiHMBPA+pantothenate of 
1:10 is achieved (/.e., 10% HMBPA versus 90% pantothenate, for example, as 
determined by comparing the peaks when a sample of product is analyzed be HPLC), 
preferably such that at least a ratio of 2: 10 is achieved (20% HMBPA versus 90% 

20 pantotheante), more preferably such that a ratio of at least 2.5 : 1 0 is achieved (25% 

HMBPA versus 75% pantotheante), more preferably at least 3:10 (30% HMBPA versus 
70% pantotheante), 4:10 (40% HMBPA versus 60% pantotheante), 5:10 (50% HMBPA 
versus 50% pantotheante), 6:10 (60% HMBPA versus 40% pantotheante), 7:10 (70% 
HMBPA versus 30% pantotheante), 8:10 (80% HMBPA versus 20% pantotheante), 9:10 

25 (90% HMBPA versus 1 0% pantotheante) or greater. 

The methodology of the present invention can further include a step of 
recovering a desired compound (e.g., HMBPA). The term "recovering" a desired 
compound includes extracting, harvesting, isolating or purifying the compound from 
culture media. Recovering the compound can be performed according to any 

30 conventional isolation or purification methodology known in the art including, but not 
limited to, treatment with a conventional resin (e.g., anion or cation exchange resin, non- 
ionic adsorption resin, etc.), treatment with a conventional adsorbent (e.g., activated 
charcoal, silicic acid, silica gel, cellulose, alumina, etc.)» alteration of pH, solvent 
extraction (e.g., with a conventional solvent such as an alcohol, ethyl acetate, hexane and 
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the like), dialysis, filtration, concentration, crystallization, recrystallization, pH 
adjustment, lyophilization and the like. For example, a compound can be recovered 
from culture media by first removing the microorganisms from the culture. Media are 
then passed tlirough or over a cation exchange resin to remove cations and then through 
5 or over an anion exchange resin to remove inorganic anions and organic acids having 
stronger acidities than the compound of interest The resulting compound can 
subsequently be converted to a salt (e.g., a calcium salt) as described herein. 

Preferably, a desired compound of the present invention is "extracted", 
"isolated" or "purified" such that the resulting preparation is substantially jfree of other 

10 media components (e.g. , free of media components and/or fermentation byproducts). 
The language "substantially free of other media components" includes preparations of 
the desired compound in which the compound is separated fi-om media components or 
fermentation byproducts of the culture from which it is produced. In one embodiment, 
the preparation has greater than about 80% (by dry weight) of the desired compound 

15 (e.g. , less than about 20% of other media components or fermentation byproducts), more 
preferably greater than about 90% of the desired compound (e.g., less than about 10% of 
other media components or fermentation byproducts), still more preferably greater than 
about 95% of the desired compound (e,g, less than about 5% of other media components 
or fermentation byproducts), and most preferably greater than about 98-99% desired 

20 compound (e.g., less than about 1-2% other media components or fermentation 

byproducts). When the desired compound has been derivatized to a salt, the compound - j 
is preferably further firee of chemical contaminants associated with the formation of the 
salt. When the desired compound has been derivatized to an alcohol, the compound is 
preferably further free of chemical contaminants associated with the formation of the 

25 alcohol. 

In an alternative embodiment, the desired compoimd is not purified from 
the microorganism, for example, when the microorganism is biologically non-hazardous 
(eg., safe). For example, the entire culture (or culture supernatant) can be used as a 
source of product (e.g., crude product). In one embodiment, the culture (or culture 
30 supernatant) is used without modification. In another embodiment, the culture (or 

culture supernatant) is concentrated. In yet another embodiment, the culture (or culture 
supernatant) is dried or lyophilized. 
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Preferably, a production method of the present invention results in 
production of the desired compound at a significantly high yield. The phrase 
"significantly high yield" includes a level of production or yield which is sufficiently 
elevated or above what is usual for comparable production methods, for example, which 
5 is elevated to a level sufficient for commercial production of the desired product (e.g., 
production of the product at a commercially feasible cost). In one embodiment, the 
invention features a production method that includes culturing a recombinant 
microorganism under conditions such that the desired product (e.g., HMBPA) is 
produced at a level greater than 2 g/L. In another embodiment, the invention features a 
production method that includes culturing a recombinant microorganism xmder 
conditions such that the desired product (e,g. , HMBPA) is produced at a level greater 
than 10 g/L. In another embodiment, the invention features a production method that 
includes culturing a recombinant microorganism under conditions such that the desired 
product (e.g. , HMBPA) is produced at a level greater than 20 g/L. In yet another 
embodiment, the invention features a production method that includes culturing a 
recombinant microorganism under conditions such that the desired product (e.g., 
HMBPA) is produced at a level greater than 30 g/L. In yet another embodiment, the 
invention features a production metiiod that includes culturing a recombinant 
microorganism under conditions such that the desired product {e.g., HMBPA) is 
produced at a level greater than 40 g/L. The invention further features a production 
method for producing the desired compound that involves culturing a recombinant 
microorganism under conditions such that a sufficiently elevated level of compound is 
produced within a commercially desirable period of time. 

Depending on the biosynthetic enzyme or combination of biosynthetic 
enzymes manipulated, it may be desirable or necessary to provide (e.g., feed) 
microorganisms of the present invention at least one biosynthetic precursor such that the 
desired compound or compounds are produced. The term "biosynthetic precursor" or 
"precursor" includes an agent or compound which, when provided to, brought into 
contact with, or included in the culture medium of a microorganism, serves to enhance 
or increase biosynthesis of the desired product In one embodiment, the biosynthetic 
precursor or precursor is aspartate. In another embodiment, the biosynthetic precursor or 
precursor is P-alanine. The amount of aspartate or (J-alanine added is preferably an 
amount that results in a concentration in the culture medium sufficient to enhance 
productivity of the microorganism (e.g., sl concentration sufficient to enhance production 
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of HMBPA. The term "excess P-alanine" includes p-alanine levels increased or higher 
that those routinely utilized for culturing the microorganism in question. For example, 
culturing the Bacillus microorganisms described in the instant Examples is routinely 
done in the presence of about 0-5 g/L P-alanine. Accordingly, excess P-alanine levels 
5 can include levels of about 5-10 g/L or more preferably about 5-20 g/L P-alanine. 
Biosynthetic precursors of the present invention can be added in the form of a 
concentrated solution or suspension (e.g., in a suitable solvent such as water or buffer) or 
in the form of a solid (e.g., in the form of a powder). Moreover, biosynthetic precursors 
of the present invention can be added as a single aliquot, continuously or intermittently 

10 over a given period of time. 

In yet another embodiment, the biosynthetic precursor is valine. In yet 
another embodiment, the biosynthetic precursor is a-ketoisovalerate. Preferably, valine 
or a-ketoisovalerate is added in an amount that results in a concentration in the medium 
sufficient for production of the desired product (e.g. , HMBPA) to occur. The term 

15 "excess a-KIV" includes a-KIV levels increased or higher that those routinely utilized 
for culturing the microorganism in question. For example, culturing the Bacillus 
microorganisms described in the instant Examples can be done in the presence of about 
0-5 g/L a-KIV. Accordingly, excess a-KIV levels can include levels of about 5-10 g/L, 
and more preferably about 5-20 g/L. The term "excess valine" includes valine levels 

20 increased or higher that those routinely utilized for culturing the microorganism in ; 
question. For example, culturing the Bacillus microorganisms described in the instant 
Examples is routinely done in the presence of about 0-0.5 g/L valine. Accordingly, 
excess valine levels can include levels of about 0.5-5 g/L, preferably about 5-20 g/L 
valine. Biosynthetic precursors are also referred to herein as "supplemental biosynthetic 

25 substrates". 

Moreover, certain aspects of the present invention include culturing 
microorganisms (e.g., recombinant microorganisms) under conditions of increased 
steady state glucose, decreased steady state dissolved oxygen and/or decreased serine. 
The term "increased steady state glucose" includes steady state glucose levels increased 
30 or higher that those routinely utilized for culturing the microorganism in question. For 
example, culturing the Bacillus microorganisms described in the instant Examples is 
routinely done in the presence of about 0.2-1.0 g/L steady state glucose. Accordingly, 
increased steady state glucose levels can include levels of about 1-2 g/1, about 2-5 g/I, 
and preferably about 5-20 g/L steady state glucose. The term "decreased steady state 
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dissolved oxygen" includes steady state dissolved oxygen levels less or lower that those 
routinely utilized for culturing the microorganism in question and, for example, 
inversely correlates with increased steady state glucose levels. For example, culturing 
the Bacillus microorganisms described in the instant Examples is routinely done in the 
5 presence of about 10-30% dissolved oxygen. Accordingly, decreased steady state 
dissolved oxygen can include levels of about 0-10%, and preferably about 0-5% steady 
state dissolved oxygen. The term "reduced serine'* includes serine levels within the 
lower range of those routinely utilized for culturing the microorganism in question. For 
example, culturing the Bacillids microorganisms described in the instant Examples is 

10 routinely done in the presence of about 0-0.5 g/L serine. Accordingly, reduced serine 
levels can include, for example, levels of 0-0.1 g/L serine. 

Another aspect of the present invention includes biotransformation 
processes which feature the recombinant microorganisms described herein. The term 
"biotransformation process", also referred to herein as "bioconversion processes", 

1 5 includes biological processes which results in the production (e.g. , transformation or 
conversion) of appropriate substrates and/or intermediate compounds into a desired 
product (e.g. , HMBPA). 

In one embodiment, the invention features a biotransformation process 
for the production of HMBPA comprising contacting a microorganism which 

20 overexpresses a reductase (e.g., overexpresses PanEl, PanE2 and/or IlvC) with 

appropriate substrates or precursors under conditions such that HMBPA is produced and 
recovering said HMBPA. In another embodiment, the invention features a 
biotransformation process for the production of HMBPA comprising contacting a 
microorganism which has a reduced or deleted PanB activity with appropriate substrates 

25 or precursors under conditions such that HMBPA is produced and recovering said 

HMBPA. In yet another embodiment, the invention features a biotransformation process 
for the production of HMBPA comprising contacting a microorganism which 
overexpresses at least one reductase and has a reduced or deleted PanB activity with 
appropriate substrates or precursors under conditions such that HMBPA is produced and 

30 recovering said HMBPA. Preferred recombinant microorganisms for carrying out the 
above-described biotransfomiations include the recombinant microorganisms described 
herein. In yet another embodiment, the invention features a biotransformation reaction 
that includes contacting ocHTV and p-alanine with isolated or purified PanC under 
conditions such that HMBPA is produced. a-HIV can optionally be obtained by 



-27- 



wo 02/057476 



PCT/US02/01887 



contacting a-KIV with purified or isolated reductase (e.g., PanEl, PanE2 and/or IlvC) 
and a source of reducing equivalent, for example, NADH. Conditions under which a- 
HIV or HMBPA are produced can include any conditions which result in the desired 
production of a-HIV or HMBPA, respectively. In yet another embodiment, the present 
5 invention includes a method of producing a-HIV that includes culturing a 

microorganism that overexpresses PanEl and/or PanE2, and/or IlvC and has a reduced 
or deleted PanC or PanD (to reduce HMBPA or p-alanine sunthesis, respectively) under 
conditions such that a-HIV is produced. 

The microorganism(s) and/or enzymes used in the biotransformation 

1 0 reactions are in a form allowing them to perform their intended function {e,g, , producing 
a desired compound). The microorganisms can be whole cells, or can be only those 
portions of the cells necessary to obtain the desired end result. The microorganisms can 
be suspended (e.g., in an appropriate solution such as buffered solutions or media), 
rinsed (e.g., rinsed free of media ftom culturing the microorganism), acetone-dried, 

1 5 immobilized (e.g. , with polyacrylamide gel or k-carrageenan or on synthetic supports, 
for example, beads, matrices and the like), fixed, cross-linked or permeablized (e.g., 
have permeablized membranes and/or walls such that compounds, for example, 
substrates, intermediates or products can more easily pass through said membrane or 
wall). 

20 

This invention is further illustrated by the following examples which 
should not be construed as limiting. The contents of all references, patents and 
published patent applications cited throughout this application are incorporated herein by 
25 reference. 



-28- 



wo 02/057476 



PCT/US02/01887 



EXAMPLES 

Example I: Discovery and Characterization of the [R]-3-(2"hydroxy-3-methyl- 
butyrylamino)-propionic acid (HMBPA) Biosynthetic Pathway 

5 In developing Bacillus strains for the production of pantothenate, various 

genetic manipulations were made to enzymes involved in the pantothenate biosynthetic 
pathway and the isoleucine- valine (/7v) pathway (Figure 1) as described in U.S. Patent 
Application Serial No. 09/400,494 and U.S. Patent Application Serial No. 09/667,569. 
For example, strains having a deregulated panBCD operon and/or having deregulated 
1 0 panEl exhibited enhanced pantothenate production (when cultured in the presence of (3- 
alanine and a-KIV). Strains further deregulated for ilvBNC and ilvD exhibited enhanced 
pantothenate production in the presence of only p-alanine. Moreover, it was possible to 
. achieve P-alanine independence by further deregulating panD. 

An exemplary strain is PA824, a tryptophan prototroph, Spec and Tet 
15 resistant, deregulated for panBCD at the panBCD locus, deregulated for panEl at the 
panEl locus (two genes in the B, subtilis genome are homologous to E, colipanE, 
panEl and panE2, the former encoding the major ketopantoate reductase involved in 
. . pantothenate production, while panE2 does not contribute to pantothenate synthesis 
(U.S. Patent Application Serial No. 09/400,494), deregulated for ilvD at the ilvD locus, 
20 pverexpressing an ilvBNC cassette at the amyE locus, and overexpressing panD at the 
bpr locus. 

The production of pantothenic acid by PA824 was investigated in 14 L 
fermentor vessels. The composition of the batch and feed media are as follows. 



25 



BATCH 





MATERIAL 


g/L (final) 


1 


Yeast extract 


10 


2 


Na Glutamate 


5 


3 




8 


4 


KH2PO4 


5 


5 


K2HPO4 


7.6 



Addded After Sterilization and Cool Down 
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1 


Glucose 


2.5 


2 


CaCl2 


0.1 


3 


MgCl^ 


1 


4 


Sodium Citrate 


1 


5 


FeS04-7H20 


0.01 


5 


SM-IOOOX 


1ml 



The final volume of the batch medium is 6 L. The trace element solution 
SM-IOOOX has following composition: 0.15 g Na2Mo04-2 H20, 2.5 g H3BO3, 0.7 g 
C0CI2 6 H2O, 0,25 g CuS04-5 H20, 1 .6 g MnCl2-4 H2O, 0.3 g ZnS04-7 H2O are dissolved 
5 in water (final volume IL). 

The batch medixmi was inoculated v^th 60 ml of shake flask PA824 
culture (OD=l 0 in SVY medium: Difco Veal Infusion broth 25 g, Difco Yeast extract 5 
g, Sodium Glutamate 5 g, (NH4)2S04 2.7 g in 740 ml HjO, autoclave; add 200 ml sterile 
1 M K2HPO4 (pH 7) and 60 ml sterile 50% Glucose solution (final volume IL)). The 
1 0 fermentation was run at 43 °C at an air flow rate of 1 2 L/min as a glucose limited fed 
batch. The initial batched glucose (2.5 g/L) was consumed during exponential growth. 
Afterwards glucose concentrations were maintained between 0.2-1 g/L by continuous 
feeding of FEED solution as follows. 



FEED 





MATERIAL 


g/L (final) 


1 


Glucose 


550 


2 


CaClj 


0.1 


3 


SM-IOOOX 


3ml 



15 

The variable feed rate pimip was computer controlled and linked to the 
glucose concentration in the tank by an algorithm. In this example the total feeding was 
6L. 

During fermentation tiie pH was set at 7.2. Control was achieved by pH 
20 measurements linked to computer control. The pH value was maintained by feeding 
either a 25% NHj-solution or a 20% H3P04-solution. NH3 acts simultaneousely as a N- 
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source for the fermentation. The dissolved oxygen concentration [pOj] was set at 30% 
by regulation of tlie agitation and aeration rate. Foaming was conhoUed by addition of 
silicone oil. After the stop of the addition of the feed solution, in this example after 48 
h, the fermentation was continued until the [pOj] value reached 95%. Then the 

5 fermentation was stopped by kilUng the microorganism through sterilization for 30 min. 
The successftil sterilization was proven by plating a sample of the fermentation broth on 
agar plates. The pantothenate titer in the fermentation broth was 21 .7 g/L after 
sterilization and removal of the cells by centrifiigation (determined by HPLC analysis). 

For HPLC analysis the fermentation broth sample was diluted with sterile 

10 water (1 :40). 5 jil of this dilution was injected into a HPLC column (Aqua C 1 8, 5|am, 
150*2.0 mm, Phenomenex™). Temperature of the column was held at 40°C. Mobile 
phase A was 14.8 mM H3PO3, mobile phase B 100% Acetonitrile. Flow rate was 
constant at 0.5 mL/min. A gradient was applied : 

start: 2% mobile phase B 

1 5 0-3 min linear increase to 3% mobile phase B 

3-3.5 min linear increase to 20% mobile phase B 
The detection was carried out by an UV-detector (210 mn). Run time 
was 7 min with an additional 3 min posttime. The retention time for pantothenic acid is 
3.9 minutes. The HPLC chromatogram for the aibove mentioned sample is given in 

20 Figure 4. 

Identification of compound related to peak with retention time 4. 7 minutes 

Under the described fermentation conditions, PA824 routinely yields 
approximately 20-30 g/L pantothenate. In addition to producing significant quantities of 

25 pantothenate, it was discovered a second compound eluted with an approximate 

retention time of 4.7 minutes in this system. The second prominent product formed in 
the fermentation was shown to be [R]-3-(2-hydroxy-3-methyl-butyrylamino)-propionic 
acid (HMBPA) (also referred to herein as "P-alanine 2-(jR)-hydroxyisolvalerate", "p- 
alanine 2-hydroxyisolvalerate", and/or "P-alanyl-a-hydroxyisovalarate). It was 

30 identified by its mass spectrum (Figure 5; relative monoisotopic mass 1 89), 'H- and 
13C-NMR (data not shown) after chromatographic purification by reverse phase flash 
chromatography (mobile phase 10 mM KH2PO4, with increasing contents of acetonitiile 
(1-50%)). 

35 
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In order to verify the identity of the compound, deliberate synthesis of 
racemic P-alanine 2-hydroxyisolvalerate was performed as follows. P-alanine (2.73 g / 
30 mmol) and sodium methoxide (5.67 g of a 30% solution in methanol / 3 1 .5 mmol) 
were dissolved in methanol (40 mL). Methyl 2-hydroxyiso valerate (2-hydroxy-3- 
5 methylbutyric acid methyl ester) (3.96g / 30 mmol) was added and refluxed for 1 8 hours. 
Methanol was then removed by rotavap and replaced by tert-butanol (50 mL). 
Potassium tert-butoxide was added (50 mg) and refluxed for 26 hours. The solvent was 
removed in vacuo, the residue dissolved in water (50 mL) and passed through a strongly 
acidic ion-exchange resin (H+-form Lewatite™ S 100 Gl; 100 mL). More water is used 

10 to rinse the ion exchanger. The aqueous eluates are combined and the water removed in 
vacuo. The residue is subjected to flash chromatography (silica gel; 2% acetic acid in 
ethyl acetate as eluent) and the product fractions evaporated to give a solid residue. The 
residue was recrystallized from ethyl acetate / toluene (10 mL / 20 mL, respectively) and 
analj^ically pure HMBPA (p-alanine 2-hydroxyisolvalerate) was obtained, which 

1 5 showed a relative monoisotopic mass of 1 90 (1 89 + H"^) in the mass spec and the same 
^H-NMR resonances as the product obtained from fermentation. 

The biosynthetic pathway resulting in HMBPA production is set forth in 
Figure 2. The chemical structure of [R]-3-(2-hydroxy-3-methyl-butyrylamino)- 
propionic acid (HMBPA) is depicted in Figure 3. As depicted in Figure 2, HMBPA is 

20 the condensation product of a-hydroxyisovaleric acid (a-HIV) and P-alanine, catalyzed 
by the PanC enzyme. a-HIV is generated by reduction of a-KIV, a reaction which is 
catalyzed by the reductases PanE (e.g., PanEl and/or PanE2) and/or IlvC. 

Based on the chemical structure and biosynthetic pathway leading to 
HMBPA production, the present inventors formulated the following model to describe 

25 the interaction between the previously knovra pantothenate and isoleucine- valine (ilv) 
pathways and the newly characterized HMBPA biosynthetic pathway. In at least one 
aspect, the model states that there exist at least two pathways in microorganisms that 
compete for a-KIV, the substrate for the biosynthetic enzyme PanB, namely the 
pantothenate biosynthetic pathway and the HMBPA biosynthetic pathway. (A third and 

30 fourth pathway competing for a-KIV are those resulting in the production of valine or 
leucine from a-KIV, see e.g.. Figure 1). At least the pantothenate biosynthetic pathway 
and the HMBPA biosynthetic pathway fiirther produce competitive substrates for the 
enzyme PanC, namely a-HIV and pantoate. The model predicts that reducing PanB 
activity will increase a-KIV availability for a-HIV synthesis (and ultimately, HMBPA 
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synthesis) and decrease the amoiint of pantoate and/or pantothenate synthesized by a 
microorganism. Conversely, increasing PanB activity will increase pantoate and 
ketopantoate availability for pantoate/pantothenate synthesis. The following examples 
provide expermental support for the model and further exemplify processes for 
5 increasing the production of HMBPA based on the model 

EXAMPLES n-VI; 

For Examples II-VI, quanitation of pantothenate and/or HMBPA was 
10 performed as follows. Aliquots of fermentation media were diluted 1 : 1 00 and aliquots 
of test tube cultures were diluted 1 : 1 0 in water or 5% acetonitrile prior to injection on a 
Phenomenex Aqua™ 5|a C18 HPLC column (250 x 4.60mm, 125 A). Mobile phases 
were A = 5% acetonitrile, 50 mM monosodium phosphate buffer adjusted to pH 2.5 with 
phosphoric acid; and B = 95% acetonitrile, 5% HjO. 
1 5 Linear gradients were as follows. 



Minutes 


Solvent A 


Solvent B 


0 


100% 


0% 


16 


100% 


0% 


17 


0% 


100% 


20 


0% 


100% 


21 


100% 


0% 



Additional parameters and apparatus were as follows: Flow rate =1.0 
ml/min; Injection volume = 20 pJ; Detector = Hewlett Packard 1090 series DAD UV 
20 detector-3014. Signal A = 197 nm, ref. = 450 nm, Firmware revision E; Colimm heater = 
Oven tempature 40°C; Hardware = Hewlett Packard Kayak™ XA; and Software = 
Hewlett Packard Chemstation Plus™ family revision A.06.03[509]. 

Under these fermentation conditions, PA824 routinely yields 
approximately 30-40 g/L pantothenate. HMBPA elutes at approximately 13 minutes in 
25 this system. 
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Example II: Ketopantoate Reductase Contributes to the Production of HMBPA 
and Increasing Ketopantoate Reductase Activity in Bacillus Results 
in Enhanced HMBPA Production 
5 As described in Example I, a novel HPLC peak corresponding to 

HMBPA was observed in microorganisms overexpressing pojiE] indicating that 
increased ketopantoate reductase contributes to the production of HMBPA (in addition 
to production of pantothenate). As mentioned previously, two genes in the B, subtilis 
genome are homologous to the E, colipanE gene encoding ketopantoate reductase and 

10 have been named panEl and panE2, In Bacillus, the panEl gene encodes the major 
ketopantoate reductase involved in pantothenate production, while panE2 does not 
contribute to pantothenate synthesis. In fact, overexpression of panE2 from a P26 
promoter leads to a reduction in pantothenate titer (see e.g., U.S. Patent Application 
Serial No. 09/400,494). 

1 5 Accordingly, it was tested whether, beside being produced by the panEl 

gene product, it was possible that a significant portion of the a-HIV necessary to make 
HMBPA was being produced by the panE2 gene product. It was hypothesized that the 
panE2 gene product is an enzyme that can reduce a-KIV to a-HIV, but that can not 
significantly reduce ketopantoate to pantoate. 

20 To test the hypothesis, 77a«jE2 was deleted from pantothenate production 

strain PA824 (described in Example I) by transforming with a ApanE2::cat cassette 
from chromosomal DNA of strain PA248 (ApanE2::caf) (set forth as SEQ ID NO:24, 
for construction see e.g., U.S. Patent Application Serial No. 09/400,494) to give strain 
PA919. Three isolates of PA919 were compared to PA824 for pantothenate and 

25 HMBPA production in test tube cultures grown in SVY plus P-alanine. 
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As indicated by the data in Table 1, all three isolates of PA919 produced 
about four-fold lower HMBPA than PA824 demonstrating that the panE2 gene product 
is a potent contributor to HMBPA synthesis. Moreover, significant increases in 
5 HMBPA production can be achieved simply by overexpression of panE2. An exemplary 
plasmid for the overexpression of panE2, named pAN238, is set forth as SEQ ID NO:25 
(Figure 10). 



1 0 Example III. Increasing Production of HMBPA by Reducing PanB Activity in 
Microorganisms. 

Strains derived from PA365 (the RL-1 lineage equivalent of PA377, 
described in U.S. Patent Application Serial No. 09/667,569) which are deleted for the 
F26 panBCD cassette and which contain a P26 panC^D cassette amplified at the vpr 

1 5 locus and either the wild type P26 P<^^^ cassette (PA666) or a P26 ApanB cassette 

(PA664) amplified at the bpr locus were constructed as follows. An alignment of the C- 
terminal amino acids of known or suspected PanB proteins is shown in Figure 6. Three 
regions called 1, 2 and 3, that were identified having conserved or semi-conserved amino 
acid residues, are mdicated by arrows at the top of the figure. The B, subtilis PanB 

20 protein (RBS02239) is underlined. Two of the PanB proteins (RCY14036 and 

CAB56202.1) are missing region 3 while the latter PanB protein is also missing region 2 
and has non-conserved amino acid residues occupying region 1. 

5. subtilis PanB variants were created that were missing regions 1, 2 and 
3. The desured variants were created by designing 3' PGR primers to amplify the B. 

25 subtilis pan B gene such that region 3, regions 2 and 3, or all three regions would be 
missing from the final product. The PGR products were generated and cloned into E, 
coli expression vector pASK-lBA3, creating plasmids pAN446, pAN447, and pAN448, 
respectively. The plasmids were then transformed into E, coli strain SJ2 that contains 
the panB6 mutation to test for complementation. Only pAN446, which is missing region 

30 3, was able to complement. This indicates that region 3 is not essential for B. subtilis 
PanB activity but that region 2 is required for activity or stability. 
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The next step in this analysis was to transfer the panB gene from pAN446 
to a B, subtilis expression vector and then introduce it into a strain appropriate for 
testing activity of the encoded PanB protein in 5. subtilis. To do this, a strain that is 
deleted for the P26 panBCD operon was first created. This was accomplished by first 
5 inserting a cat gene between the BseKS. site located just upstream of the panB RBS and 
the BgH\ site located in panD, creating plasmid pAN624 (Figure 7). The sequence of 
pAN624 is set forth as SEQ ID NO:20. The resulting deletion-substitution mutation 
(ApanBCD::cat624X which removes all of panB and panC, was crossed into PA354 by 
transformation, with selection for resistance to chloramphenicol on plates supplemented 

10 with 1 mM pantothenate. One of the transformants was saved and named PA644. 
Chromosomal DNA isolated from PA644 was analyzed by PGR and was shown to 
contain the deletion-substitution mutation. As expected, PA644 requires pantothenate 
for growth but retains the engineered ilv genes (^2S"^BNC Pae'/vD) as well as the V2epcin 
El gene originally present in PA354. Thus, it has all the enzymes involved in pantoate 

15 synthesis overproduced except PanB. The gene containing the shortest panB deletion 
was inserted into B. subtilis expression vector pOTP61 (described in US patent 
application Serial No. 09/667,569), creating plasmid pAN627. At the same time, a wild- 
type panB control gene was inserted into pOTP61, creating plasmid pAN630. The Notl 
fragments of each plasmid, lacking E. coli vector sequences, were ligated and 

20 transformed into PA644, with selection for resistance to tetracycline. 

One transformant from each transformation was saved and further 
transformed with chromosomal DNA from PA628 with selection for Pan"*^. PA628 
contains a multicopy Vj^anC^D expression plasmid (pAN620) integrated at the vpr 
locus. In order to determine the effects of the panB gene mutation directly on 

25 pantothenate production, plasmid pAN620, set forth as SEQ ID NO:21 and illustrated 
schematically in Figure 8, provides the remaining two enzymes required for 
pantothenate synthesis (PanC and PanD). Four transformants from each transformation 
were isolated, grown in SVY medium containing 10 g/L aspartate for 48 hours, then 
assayed for pantothenate production. Transformants with the 3 'deleted /7^/25 gene were 

30 named PA664 and those containing the wild-type gene were called PA666. The data 
showed that the 3' deleted panB gene in PA664 encodes a PanB protein with greatly 
reduced activity. To test for HMBPA production, test tube cvdtures of PA365, PA666, 
and PA664 were grown in SVY + aspartate medium with and without added a-KIV or 
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pantoate for 48 hours and then assayed for HMBPA and pantothenate as described 
previously. 
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The data presented in Table 2 demonstrate that in the absence of 
supplements, PA664 produced the most HMBPA while PA666 produced the least, 
indicating an inverse correlation between PanB activity and HMBPA production. This 
5 is consistent with the model which predicts that the two pathways compete for a-KIV, 
the substrate for PanB, and produce competitive substrates for PanC; lowering PanB 
activity would be expected to increase a-KIV availability for a-HIV synthesis and 
increase HMBPA production, correspondingly decreasing the amount of pantoate 
synthesized. When a-KIV is added to the medium, alLthree strains produced 

10 significantly more HMBPA. This result evidences that a-KIV is a precursor to 

HMBPA, as described in Figure 2, and that excess a-KIV favors HMBPA production. 
This result also suggests that synthesis of HMBPA is at least partially due to an overflow 
effect of excess a-KTV production. When pantoate was added to the medium, HMBPA 
was reduced by roughly 50 percent in all three strains. Conversely, the strains each 

1 5 produced significantly more pantothenate. This resuk is also consistent with the model 
that the two pathways produce competing substrates for PanC (a-HIV and pantoate). 
Taken together, the above results further indicate that decreasing pantoate synthesis 
should be beneficial in promoting HMBPA production as well as reducing pantothenate 
levels. 

20 

Example IV. Methods for Regulating HMBPArPantothenate Levels 

As demonstrated in Examples I and II, PanEl and/or PanE2 
contribute to enJianced HMBPA production as does reduced PanB activity. This - 
Example demonstrates that overexpressing PanEl increases HMBPA production relative 

25 to pantothenate production whereas overexpressing PanB decreases HMBPA production 
relative to pantothenate production. Furthermore, in strains overexpressing IlvC, 
HMBPA production is enhanced. 

PA668 is a derivative of PA824 that contains extra copies of P26 pcinB 
amplified at the vpr or panB locus. PA668 was constructed using the panB expression 

30 vector (pAN636) which allows for selection of multiple copies using chloramphenicol. 
The sequence of pAN636 is set forth as SEQ ID NO:22 and the vector is depicted 
schematically in Figure 9. The pAN636 iVb?/ restriction fragment, missing the E, coli 
vector sequences, was ligated and then used to transform PA824 with selection on plates 
contailing 5 |ig/ml chloramphenicol. Transformants resistant to 30 i^ig/ml 
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chloramphenicol were isolated and screened for pantothenate production in 48 hour test 
tube cxiltures. The isolates shown produce less HMBPA that PA824 (conversely 
producing about 10 percent more pantothenate than PA824). A second strain, called 
PA669, was constructed which is PA824 with extra copies of P26 pcinEl amplified at 
5 the vpr or panEl locus. Strain PA669 was constructed by transforming PA824 with the 
self-hgated Notl fragment of plasmid pAN637 with selection for resistance to 
chloramphenicol. The sequence of pAN637 is set forth as SEQ ID NO:23 and the vector 
is depicted schematically in Figure 10. Two isolates of PA669 were chosen for further 
study; PA669-5 produces less PanEl than PA669-7 as judged by SDS-PAGE analysis of 
10 total ceil extracts made from the two strains. 

Test tube cultures of strains PA824, PA668-2, PA668-24, and the two 
isolates of PA669 (PA669-5 and PA669-7) were grown in three different media (SVY, 
SVY + aspartate, and SVY + aspartate + pantoate) for 48 hours and then assayed for 
pantothenate, HMBPA, and p-alanine (Table 3). 
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None of the strains produced detectable quantities of HMBPA in SVY 
medium. All strains produced roughly equivalent amounts of pantothenate and low 
amounts of P-alanine indicating that p-alanine is limiting for both pantothenate and 
5 HMBPA synthesis in these cultures and that p-alanine is a precursor for both 
compounds. When grown in SVY + aspartate medium, the two PA669 isolates 
produced more HMBPA than PA824 whereas both PA668 isolates produced less 
HMBPA than PA824. It is noteworthy that the strain that produces the most PanEl 
(PA669-7) produced the most HMBPA (and the least pantothenate). This suggests that 

1 0 high levels of PanEl favor the production of HMBPA at the expense of lower 

pantothenate synthesis. It is also interesting that PA668-24 produced more HMBPA 
than PA668-2, even though SDS-PAGE analysis of extracts from the two strains showed 
that they produce roughly equivalent levels of PanB. The SDS-PAGE analysis also 
showed that PA668-24 makes much more IlvC than PA668-2. Based on these data, it is 

15 proposed that IlvC influences HMBPA synthesis by increasing steady state levels of 
a-KIV and/or by catalyzing a-HIV formation from a-KIV, thereby accoimting for the 
observed shift towards production of HMBPA. 

The final set of data in Table 3 shows that adding pantoate to the growth 
medium decreased HMBPA production by all stiains that had previously produced 

20 detectable levels, e.g., by shifting synthesis towards pantothenate. This further supports 
the model that a-HIV and pantoate are competitive substrates for PanC. 



Example V: Increasing HMBPA Production by Limiting Serine Availability 

25 

It was hypothesized that the ratio of pantothenate to HMBPA production 
could also be controlled by regulating the availability of serine or methylene 
tetrahydrofolate in the microorganism cultures. In particular, it is proposed that 
decreasuig the availability of serine could increase HMBPA production relative to 

30 pantothenate production, whereas increasing the availability of serine would decrease the 
production of HMBPA relative to pantothenate production. This method is based on the 
understanding that the PanB substrate, methylenetetrahydrofolate is derived from serine. 
Thus, regulating serine levels should effectively regulate PanB substrate levels. To test 
this hypothesis, PA824 was grown in test tube cultures of SVY glucose plus 5 g/L P- 

35 alanine and ± 5 g/L serine for 48 hours at 43°C. 
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Table 4: Production of HMBPA and pantothenate by PA824 with and without 
the addition of serine 



serine added 
at5g/L 


OD«H, 


[pan]g/L 


[HMBPA] g/L 




16.3 


4.9 


0.84 




14.0 


4.5 


0.80 


+ 


13.1 


6.4 


0.56 


+ 


12.9 


6.0 


0.62 



5 

As demonstrated in Table 4, addition of serine decreases the level of 
production while conversely increasing pantothenate production. At least one method of 
decreasing methylene tetrahydrofolate levels in order to regulate HMBPA production 
levels is to decrease the activity of serine hydroxymethyl transferase (the glyA gene 
10 product), thereby decreasing methylene tetrahydrofolate biosynthesis in appropriately 
engineered microorganisms. At least one method of decreasing serine levels in order to 
regulate HMBPA production is to decrease tiie activity of 3-phosphoglycerate 
dehydrogenase (the serA gene product). 

15 

Example VI: Increasing HMBPA Production by Modifying Culture Conditions for . 
Recombinant Microorganisms 

In at least one fermentation (Fermentation PI 62), levels of HMBPA 
production reached 35 g/L. Briefly, fermentation of strain PA824 was carried out as 
20 described in Example I but utilizing PFM-155 medium formulated as follows. 



BATCH 





MATERIAL 


g/L (final) 


1 


Amberex 1003 


5 


2 


Cargill 200/20 (soy flour) 


40 


3 


Na Glutamate 


5 


4 


(NH4)2S04 


8 


5 


MgS047H20 


1 
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6 


MAZUDF204C 


1 


7 


H2O 


qs to 4 L 


Added After Sterilization and Cool Down 








1 


KH2PO4 


10 


2 


K2HP04-3H20 


20 


. 3 




qs to 400 ml 








1 


80% Glucose 


20 


2 


CaCl2-2H20 


0.1 








1 


Sodivim Citrate 


1 


2 


FeS04'7H20 


0.01 


3 


SM-IOOOX 


IX 








FEED 




MATERIAL 


g/L (final) 


1 


80% Glucose 


800 


2 


CaCl2-2H20 


0.8 


3 


H2O 


qs to 3500 ml 


Added After Sterilization and Cool Down 








1 


Sodiiun Citrate 


2.0 


2 


FeS04-7H20 


0.02 


3 


SM-IOOOX 


2X 








4 


Glutamate Na 


5.0 


5 


H2O 


qs to 500 ml 



5 

However, as a result of loss of process control during the fermentation, 
the dissolved oxygen became limiting between 16 and 17 hours and glucose began to 
accumulate after 16 hours. 
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These changes in fermentation conditions produced the following 
significant results at or after 16 hours. Namely, synthesis of HMBPA began to increase 
with a corresponding decrease in pantothenate synthesis. In the four hour interval before 
16 hours the culture produced 7 g/1 HMBPA, four hours afterwards, 9.0 g/1. 
5 Pantothenate was the reverse with 10 g/1 and 6.0 g/1 produced between 12-16 hours and 
16-20 hours, respectively. Between 20 and 36 hours the average rate of HMBPA 
synthesis was 1.0 g/1 hr. Overall, fermentation P162 produced 35 g/1 of HMBPA in 36 
hours. 

Thus, it appears that overfeeding of glucose, and/or limitation of 
10 dissolved oxygen (e.g., beginning at about 16 hours) leads to an increase in HMBPA 
production. Accordingly, two methods for increasing HMBPA production (relative to 
pantothenate production) are to increase steady state glucose levels and/or decrease 
steady state dissolved oxygen levels. 

15 

Equivalents Those skilled in the art will recognize, or be able to ascertain using no 
more than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be encompassed by the 
20 following claims. 
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What is claimed: 

1 . A process for the production of 3-(2-hydroxy-3"methyl- 
butyrylaimno)-propionic acid (HMBPA), comprising culturing a microorganism under 

5 conditions such that HMBPA is produced and detecting the HMBPA produced by said 
microorganism. 

2. A process for the production of 3-(2-hydroxy-3-methyl- 
butyrylamino)-propionic acid (HMBPA), comprising culturing a microorganism under 

10 conditions such that HMBPA is produced and isolating the HMBPA produced by said 
microorganism. 

3 . A process for the production of 3 -(2-hydroxy-3 -methy 1- 
butyrylamino)-propionic acid (HMBPA), comprising culturing a microorganism having 

1 5 increased keto reductase activity or increased pantothenate synthetase activity in the 
presence of excess a-ketoisovalerate and excess p-alanine, such that HMBPA is 
produced. 

4. A process for the production of 3-(2-hydroxy-3-methyl- 

20 butyrylamino)-propionic acid (HMBPA), comprising culturing a microorganism having 
increased keto reductase activity and increased pantothenate synthetase activity in the 
presence of excess a-ketoisovalerate and excess p-alanine, such that HMBPA is 
produced. 

5. The process of claim 3 or 4, wherein said microorganism 
25 comprises a modified panE gene. 

6. The process of claim 5, wherein the panE gene is overexpressed, 
deregulated or present in multiple copies. 

30 7, The process of claim 3 or 4, wherein said microorganism 

comprises a modified panEl gene. 

8. The process of claim 3 or 4, wherein said microorganism 
comprises a modified panE2 gene. 
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9. The process of claim 3 or 4, wherein said microorganism 
comprises a modified panEl gene and a modified panE2 gene. 

5 10. The process of claim 3 or 4, wherein said microorganism 

comprises a modified panC gene. 

1 1 . The process of claim 3 or 4, wherein the panC gene is 
overexpressed, deregulated or present in multiple copies. 

10 

12. The process of claim 3 or 4, wherein said microorganism finther 
has increased acetohydroxyacid isomeroreductase activity. 

13. A process for the production of 3-(2-hydroxy-3-methyl- 

1 5 butyrylamino)-propionic acid (HMBPA), comprising culturing a microorganism having 
increased acetohydroxyacid isomeroreductase activity in the presence of excess a- 
ketoisovalerate and excess P-alanine, such that HMBPA is produced. 

14. The process of claim 12 or 13, wherein said microorganism 
20 comprises a modified ilvC gene. 

15. The process of claim 14, wherein the ilvC gene is overexpressed, 
deregulated or present in multiple copies. 

25 16. The process of any one of claims 3, 4 or 1 1 , wherein said 

microorganism fiirther has reduced ketopantoate hydroxymethyltransferase activity. 

17. The process of claim 16, wherein said microorganism comprises a 
modified panB gene. 

30 

18. The process of claim 1 6, wherein said microorganism has been 
deleted for the panB gene. 
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19. A process for the production of 3-(2-hydroxy-3-methyl- 
butyrylamino)-propiomc acid (HMBPA), comprising culturing a microorganism having 
reduced ketopantoate hydroxymethyltransferase activity in the presence of excess a- 
ketoisovalerate and excess p-alanine, such that HMBPA is produced. 

5 

20. A method for enhancing production of 3-(2-hydroxy-3-methyl- 
butyrylamino)-propiomc acid (HMBPA) relative to pantothenate, comprising culturing a 
recombinant microorganism under conditions such that the HMBPA production is 
enhanced relative to pantothenate production. 

21. A process for the production of 2-hydroxyiso valeric acid (a-HIV), 
comprising culturing a microorganism which overexpresses PanEl or PanE2 and which 
further has reduced PanC or PanD activity under conditions such that a-HIV is 
produced. 

22. A process for the production of 3-(2-hydroxy-3-methyl- 
butyrylamino)-propionic acid (HMBPA), comprising culturing a recombinant 
microorganism having decreased expression or activity of serA or glyA under conditions 
such that HMBPA is produced. 

23 . A process for the production of 3-(2-hydroxy-3-methyl- 
butyrylamino)-propionic acid (HMBPA), comprising culturing a recombinant 
microorganism having decreased expression or activity of serA and glyA under 
conditions such that HMBPA is produced. 

24. The process of any one of the proceeding claims wherein the 
microorganism is cultured under conditions of increased steady state glucose. 

25. The process of any one of the proceeding claims wherein the 
microorganism is cultured under conditions of decreased steady state dissolved oxygen. 

26. The process of any one of the proceeding claims wherein the 
microorganism is cultured under conditions of decreased serine. 

27. A product produced according to any one of the above claims. 
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28. A recombinant microorganism that produces 3-(2-hydroxy-3" 
methyl-butyrylamino)-propionic acid (HMBPA), the microorganism having a 
modification in at least one gene encoding ketopantoate reductase that results in 

5 increased reductase activity and having a mutation or deletion in the panB gene that 
results in reduced ketopantoate hydroxymethyltransferase activity. 

29. The recombinant microorganism of claim 28, wherein the gene 
encoding ketopantoate reductase is a panE gene. 

10 

30. The recombinant microorganism of claim 29, wherein the panE 

geiiQ is panEL 

3 1 . The recombinant microorganism of claim 29, wherein the panE 

15 gene is panE2. 

32. The recombinant microorganism of claim 28, wherein the 
microorganism has a modification in panEl and panE2. 

20 33. The recombinant microorganism of claim 28, further having a 

modification in ilvC that results in increased acetohydroxyacid isomeroreductase 
activity, 

34. A recombinant microorganism that produces 3-(2-hydroxy-3" 
25 methyl-butyrylamino)-propionic acid (HMBPA), the microorganism having a 

modification in ihC that results in increased acetohydroxyacid isomeroreductase activity 
and having a mutation or deletion in the panB gene that results in reduced ketopantoate 
hydroxymethyltransferase activity. 

30 35. The recombinant microorganism of any one of claims 28 to 34, 

wherein said microorganism belongs to the genus Bacillus. 

36. The recombinant microorganism of claim 35, which is Bacillus 

subtilis. 
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FigureZ' Proposed pathway for biosynthesis ofHMBPA. 
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[R]-3-(2-hydroxy-3-methyl-butyrylamino)-propionic acid ("HMBPA") 



FIGS 
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Figure (t. An alignment of the C-terminal amino acids from all known or suspected 
PanB proteins. 
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Figure 2 . Construction of pAN620. 
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Figure <7 . Construction of pAN636. 
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Figure 10 . Construction of pAN637. 
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Figure 11-. Structure ofpAN238, a plasmid designed to overexpress the panE2 
ygwg/romffeeP^^ promoter after integrating at bpr or panE2. 
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SEQUENCE LISTING 
<110> OitiniGene BioProducts Inc., e/a/. 

<120> METHODS AND MICROORGANISMS FOR THE PRODUCTION OF 3-(2- 
^ HYDROXY-3-METHYL-BUTYRLAMINO) -PROPIONIC ACID (HMBPA) 

<130> BGI-14 6PC 

<140> 
<141> 

<160> 25 

<170> PatentIn Ver, 2.0 

<210> 1 
<211> 194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (136) . . (141) 

<220> 

<221> -10_signal 
<222> (159) . . (164) 

<400> 1 

gctattgacg acagctatgg ttcactgtcc accaaccaaa actgtgctca gtaccgccaa 60 
tatttctccc ttgaggggta caaagaggtg tccctagaag agatccacgc tgtgtaaaaa 120 
ttttacaaaa aggtattgac tttccctaca gggtgtgtaa taatttaatt acaggcgggg 180 
gcaaccccgc ctgt 194 



<210> 2 
<211> 163 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: promoter 
sequence 

<220> 

<221> -35_signal 
<222> (113) . . (118) 

<220> 

<221> -10_signal 
<222> (136) . . (141) 
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<400> 2 

gcctacctag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 60 
tacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 



<210> 3 
<211> 127 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: promoter 
sequence 

<220> 

<221> -35_signal 
<222> (34) . . (39) 

<220> 

<221> -10_signal 
<222> (58) . . (63) 

<220> 

<221> -35_signal 
<222> (75) . . (80) 

<220> 

<221> -10_signal 
<222> (98) . . (103J 

<400> 3 

gaggaatcat agaattttgt caaaataatt ttattgacaa cgtcttatta acgttgatat 60 

aatttaaatt ttatttgaca aaaatgggct cgtgttgtac aataaatgta gtgaggtgga 120 
tgcaatg . 127 



<210> 4 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence iribosome 
binding site 



atctacaagg 



tgtggtataa taatcttaac aacagcagga cgc 



163 



<400> 4 

taaacatgag gaggagaaaa catg 



24 



<210> 5 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence :ribosome 
binding site 

<400> 5 

attcgagaaa tggagagaat ataatatg 

<210> 6 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence ; ribosome 
binding site 

<400> 6 

agaaaggagg tga 

<210> 7 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<220> 

<223> All occurrences of n = any nucleotide 
<400> 7 

ttaagaaagg aggtgannnn atg 

<210> 8 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<220> 

<223> All occurrences of n = any nucleotide 
<400> 8 

ttagaaagga ggtgannnnn atg 



<210> 9 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 
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<220> 

<223> All occurrences of n = any nucleotide 



<400> 9 

agaaaggagg tgannnnnnn atg 



23 



<210> 10 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<220> 

<223> All occurrences of n = any nucleotide 
<400> 10 

agaaaggagg tgannnnnna tg 22 

<210> 11 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 



<210> 12 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<400> 12 

ccctctagag gaggagaaaa catg 24 

<210> 13 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 



<400> 11 

ccctctagaa ggaggagaaa acatg 



25 



<400> 13 

ttagaaagga ggatttaaat atg 



23 



<210> 14 
<211> 23 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
binding site 

<400> 14 

ttagaaagga ggtttaatta atg 

<210> 15 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial 
binding site 

<400> 15 

ttagaaagga ggtgatttaa atg 

<210> 16 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<22d> 

<223> Description of Artificial 
binding site 

<400> 16 

ttagaaagga ggtgtttaaa atg 

<210> 17 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
binding site 

<400> 17 

attcgagaaa ggaggtgaat ataatatg 

<210> 18 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
binding site 

<400> 18 

attcgagaaa ggaggtgaat aataatg 

<210> 19 
<211> 28 
<212> DNA 



Sequence : ribosome 



23 



Sequence : ribosome 



23 



Sequence : ribosome 



23 



Sequence : ribosome 



28 



Sequence : ribosome 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 19 

attcgtagaa aggaggtgaa ttaatatg 28 



<210> 20 
<211> 6886 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
<220> 

<223> pAN624 



vector 



<400> 20 

aagaaaccaa 

tctcgctaac 

aagccatgac 

attatttgca 

atcctacctg 

ctaacaggag 

gagctgaaag 

gtgggcgaca 

tggctgttca 

gattgattta 

tacaaccgct 

tacatctata 

cgacaaaagt 

atcttaaaag 

tcgttcccca 

tggcttgccg 

tcctgcacaa 

ggaataatat 

ttaggaagca 

cttccgtcga 

gaggccgttg 

tcccccggcc 

gtggcgagcc 

tgtggcgccg 

ccaaggtaaa 

aaacgttaag 

tttcctgttt 

agcactcgat 

ttagtgacat 

gccagtcatt 

aaccatcaca 

tattaatgaa 

ttaagttaaa 

gtataggtgt 

ataaatcata 

tagatacacc 

cgtcgctatt 

aaataaatgc 



ttgtccatat 
caaaccggta 
aaaaacgcgt 
cggcgtcaca 
acgcttttta 
gaattaacca 
cccgccttat 
aaataagcgc 
ttctcgacca 
tttggattaa 
gcgatttgta 
acaacgaacc 
gtaacacgtt 
aaaacccccg 
gcatttgttc 
cttcttgatc 
ggcgtgcggc 
acgtttcaag 
ttcccacagc 
cgctctccct 
agcaccgccg 
acggggcctg 
cgatcttccc 
gtgatgccgg 
tcccccttcg 
ttttaaaaaa 
aagctaaact 
acgcgatctg 
tagaaaaccg 
aggcctatct 
aacagaatga 
ttttcctgct 
cccagtaaat 
tttgggaaac 
aaactctttg 
atcaaaaatt 
gtaaccagtt 
agggtaaaat 



tgcatcagac 
accccgctta 
aacaaaagtg 
ctttgctatg 
tcgcaactct 
tggatccgag 
gtagctcata 
cgtcaagcaa 
tttgattaga 
tgtattttga 
tgattttatc 
gttgcttatt 
ttgtacggaa 
ataaaggggg 
aattttgttt 
agacatcatt 
tgcaccgttt 
acgtgctcca 
atcaatgaga 
tatgcgactc 
ccgcaaggaa 
ccaccatacc 
catcggtgat 
ccacgatgcg 
ccgtttctgt 
caataaaaaa 
tcccgccact 
gagctgtaat 
actgtaaaaa 
gacaattcct 
tgtacctgta 
gtaataatgg 
gaagtccatg 
aatttccccg 
aagtcattct 
gtataaagtg 
ctaaaagctg 
ttatatcctt 



attgccgtca 
ttaaaagcat 
tctataatca 
ccatagcatt 
ctactgtttc 
ctcgacagta 
cttgacaaat 
ttggaatact 
gattccagta 
aaaccgctca 
gcctttcttc 
cattaaaatg 
atggagcggc 
cttttcttct 
tgatcattca 
ttgtaggaaa 
aagcatatga 
ttattattat 
tcctctagag 
ctgcattagg 
tggtgcatgc 
cacgccgaaa 
gtcggcgata 
tccggcgtag 
taccattata 
gacgagcagc 
gacagagact 
ataaaaacct 
gtacagtcgg 
gaatagagtt 
aagatagcgg 
gtagaaggta 
gaataataga 
aaccattata 
ttacaggagt 
gctctaactt 
tatttgagtt 
cttgttttat 



ctgcgtcttt 
tctgtaacaa 
cggcagaaaa 
tttatccata 
tccatacccg 
tcaagcactt 
ccaaggtcaa 
tcttcagcaa 
agctgctcaa 
gtaatttgtc 
ggcgaattcc 
gacacctcaa 
aaaaccgttt 
acaaaattgt 
gaacagccac 
taataatgac 
cgccgcttcc 
tcacaatttg 
tcgacctgca 
aagcagccca 
aaggagatgg 
caagcgctca 
taggcgccag 
aggatcaatc 
caccttttga 
atacagcacc 
ctttttgaag 
tcttcaacta 
cattatctca 
cataaacaat 
taaatatatt 
attactatta 
aagagaaaaa 
tttctctaca 
ccaaatacca 
atcccaataa 
tatcaccctt 
gtttcggtat 



tactggctct 
agcgggacca 
gtccacattg 
agattagcgg 
tttttttggg 
cacaatctgg 
aatggatatt 
ctgcttcaaa 
taaaagcagg 
cattttcgat 
ctgttgtctc 
ttcttgcata 
tactctcaaa 
acgggctggt 
tttcggctca 
cttatctcct 
ccgtttacca 
tactttttca 
ggcatgcaag 
gtagtaggtt 
cgcccaacag 
tgagcccgaa 
caaccgcacc 
ttcatccatt 
accttaacgt 
cgtctttcac 
gctttcagaa 
acggggcagg 
tattataaaa 
cctgcatgat 
gaattacctt 
ttattgatat 
gcattttcag 
tcagaaaggt 
gagaatgttt 
cctaactctc 
gtcactaaga 
aaaacactaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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tatcaatttc tgtggttata ctaaaagtcg 

cttttctctt ccaattgtct aaatcaattt 
tttttatcta aagtgaattt aggaggctta 
ttttttaaaa gtcaatatta ctgtaacata 
ccaattttcg tttgttgaac taatgggtgc 
aatgtggtct tttgtgtttt tttaaaggat 
tatcttgata ataagggtaa ctattgcatg 
tcttctgcaa gccaaaaaac cttccgttac 
cggcgagttt catccctctg tcccagtcct 
ctatctattt taataatagg tgcagttcgc 
aatcaacggg cttgtaccaa cacattagcc 
aatgccttcg ttcgtttcta aaagcagaac 
gtaaaaggtt ccgtttaacg ttctggctct 
ttcccataaa agcttaatcg gcgtaaatcc 
aaagcatagt aaaatatgct ggatgacgcc 
gctgaggctt gtcgcgatgt ccttcaattc 
cggtctccag cttggctgtt ttggcggatg 
atcagaacgc agaagcggtc tgataaaaca 
cccacctgac cccatgccga actcagaagt 
gtctccccat gcgagagtag ggaactgcca 
aagactgggc ctttcgtttt atctgttgtt 
atccgccggg agcggatttg aacgttgcga 
gcccgccata aactgccagg catcaaatta 
ttgcgtttct acaaactctt tttgtttatt 
catgagacaa taaccctgat aaatgcttca 
tcaacatttc cgtgtcgccc ttattccctt 
tcacccagaa acgctggtga aagtaaaaga 
ttacatcgaa ctggatctca acagcggtaa 
ttttccaatg atgagcactt ttaaagttct 
cgccgggcaa gagcaactcg gtcgccgcat 
ctcaccagtc acagaaaagc atcttacgga 
tgccataacc atgagtgata acactgcggc 
gaaggagcta accgcttttt tgcacaacat 
ggaaccggag ctgaatgaag ccataccaaa 
aatggcaaca acgttgcgca aactattaac 
acaattaata gactggatgg aggcggataa 
tccggctggc tggtttattg ctgataaatc 
cattgcagca ctggggccag atggtaagcc 
gagtcaggca actatggatg aacgaaatag 
taagcattgg taactgtcag accaagttta 
tcatttttaa tttaaaagga tctaggtgaa 
cccttaacgt gagttttcgt tccactgagc 
ttcttgagat cctttttttc tgcgcgtaat 
accagcggtg gtttgtttgc cggatcaaga 
cttcagcaga gcgcagatac caaatactgt 
cttcaagaac tctgtagcac cgcctacata 
tgctgccagt ggcgataagt cgtgtcttac 
taaggcgcag cggtcgggct gaacgggggg 
gacctacacc gaactgagat acctacagcg 
agggagaaag gcggacaggt atccggtaag 
ggagcttcca gggggaaacg cctggtatct 
acttgagcgt cgatttttgt gatgctcgtc 
caacgcggcc tttttacggt tcctggcctt 
tgcgttatcc cctgattctg tggataaccg 
tcgccgcagc cgaacgaccg agcgcagcga 
gatgcggtat tttctcctta cgcatctgtg 
cagtacaatc tgctctgatg ccgcatagtt 
gactgggtca tggctgcgcc ccgacacccg 
tgtctgctcc cggcatccgc ttacagacaa 
cagaggtttt caccgtcatc accgaaacgc 



tttgttggtt caaataatga ttaaatatct 2340 

tattaaagtt catttgatat gcctcctaaa 2400 
cttgtctgct ttcttcatta gaatcaatcc 24 60 
aatatatatt ttaaaaatat cccactttat 2520 
tttagttgaa gaataaagac cacattaaaa 2580 
ttgagcgtag cgaaaaatcc ttttctttct 2640 
ataagctgtc aaacatgaga attcccgttt 2700 
aacgagaagg attcttcact ttctaaagtt 27 60 - 
tttttggatc aaggcagact gctgcaatgt 2820 
aggcgatact gcccaatgga agtataccaa 2880 
caattcgata tcggcagaat agattttttt 2940 
gccttcatca tctataccta acgccttacc 3000 
catattagtg ccaataccga gcgcatagct 3060 
gtgcgtcata taatcccggt accgtttctc 3120 ' 
ggcccgatca attttttccc cagcagcttg 3180 
atctggaaaa tcattaggct gctggttaaa 3240 
agagaagatt ttcagcctga tacagattaa 3300 
gaatttgcct ggcggcagta gcgcggtggt 3360 
gaaacgccgt agcgccgatg gtagtgtggg 3420 
ggcatcaaat aaaacgaaag gctcagtcga 3480 
tgtcggtgaa cgctctcctg agtaggacaa 354 0 
agcaacggcc cggagggtgg cgggcaggac 3600 
agcagaaggc catcctgacg gatggccttt 3660 
tttctaaata cattcaaata tgtatccgct 3720 
ataatattga aaaaggaaga gtatgagtat 3780 
ttttgcggca ttttgccttc ctgtttttgc 384 0 
tgctgaagat cagttgggtg cacgagtggg 3900 
gatccttgag agttttcgcc ccgaagaacg 3960 
gctatgtggc gcggtattat cccgtgttga 4 020 
acactattct cagaatgact tggttgagta 4 080 
tggcatgaca gtaagagaat tatgcagtgc 4140 
caacttactt ctgacaacga tcggaggacc 4200 
gggggatcat gtaactcgcc ttgatcgttg 4260 
cgacgagcgt gacaccacga tgcctgtagc 4320 
tggcgaacta cttactctag cttcccggca 4380 
agttgcagga ccacttctgc gctcggccct 444 0 
tggagccggt gagcgtgggt ctcgcggtat 4 500 
ctcccgtatc gtagttatct acacgacggg 4560 
acagatcgct gagataggtg cctcactgat 4620 
ctcatatata ctttagattg atttaaaact 4 680 
gatccttttt gataatctca tgaccaaaat 4740 
gtcagacccc gtagaaaaga tcaaaggatc 4800 
ctgctgcttg caaacaaaaa aaccaccgct 4860 
gctaccaact ctttttccga aggtaactgg 4 920 
ccttctagtg tagccgtagt taggccacca 4980 
cctcgctctg ctaatcctgt taccagtggc 5040 
cgggttggac tcaagacgat agttaccgga 5100 
ttcgtgcaca cagcccagct tggagcgaac 5160 
tgagctatga gaaagcgcca cgcttcccga 5220 
cggcagggtc ggaacaggag agcgcacgag 5280 
ttatagtcct gtcgggtttc gccacctctg 534 0 
sg99999cgg agcctatgga aaaacgccag 5400 
ttgctggcct tttgctcaca tgttctttcc 54 60 
tattaccgcc tttgagtgag ctgataccgc 5520 
gtcagtgagc gaggaagcgg aagagcgcct 5580 
cggtatttca caccgcatat ggtgcactct 5640 
aagccagtat acactccgct atcgctacgt 5700 
ccaacacccg ctgacgcgcc ctgacgggct 5760 
gctgtgaccg tctccgggag ctgcatgtgt 5820 
gcgaggcagc agatcaattc gcgcgcgaag 5880 
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gcgaagcggc 

tgctactccg 
acatcattca 
tttttaaata 
gcgataggca 
cgccagctta 
gacaagcaaa 
ctgatgtact 
atcgcttcca 
tagcgccctt 
tggtgcgctt 
cattcatgcc 
tccggatgac 
cggcaaacaa 
agaatataac 
tcaatcggcg 
tcattttgcg 



atgcataatg 

tcaagccgtc 
ctttttcttc 
cccgcgagaa 
tccgggtggt 
agacgctaat 
catgctgtgc 
gacaagcctc 
tgcgccgcag 
ccccttgccc 
catccgggcg 
agtaggcgcg 
gaccgtagtg 
attctcgtcc 
ctttcattcc 
ttaaacccgc 
cttcagccat 



tgcctgtcaa 

aattgtctga 
acaaccggca 
atagagttga 
gctcaaaagc 
ccctaactgc 
gacgctggcg 
gcgtacccga 
taacaattgc 
ggcgttaatg 
aaagaacccc 
cggacgaaag 
atgaatctct 
ctgatttttc 
cagcggtcgg 
caccagatgg 
acttttcata 



atggacgaag 

ttcgttacca 
cggaactcgc 
tcgtcaaaac 
agcttcgcct 
tggcggaaaa 
atatcaaaat 
ttatccatcg 
tcaagcagat 
atttgcccaa 
gtattggcaa 
taaacccact 
cctggcggga 
accaccccct 
tcgataaaaa 
gcattaaacg 
ctcccgccat 



cagggattct 

attatgacaa 
tcgggctggc 
caacattgcg 
ggctgatacg 
gatgtgacag 
tgctgtctgc 
gtggatggag 
ttatcgccag 
acaggtcgct 
atattgacgg 
ggtgatacca 
acagcaaaat 
gaccgcgaat 
aatcgagata 
agtatcccgg 
tcagag 



gcaaacccta 

cttgacggct 
cccggtgcat 
accgacggtg 
ttggtcctcg 
acgcgacggc 
caggtgatcg 
cgactcgtta 
cagctccgaa 
gaaatgcggc 
ccagttaagc 
ttcgcgagcc 
atcacccggt 
ggtgagattg 
accgttggcc 
cagcagggga 



5940 

6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6886 



<210> 21 
<211> 7140 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
<220> 

<223> pAN620 



<400> 21 

tcggcggccg 

caaaagtggc 

ttttgtttcc 

tcctaacagc 

attcctgaaa 

ttaacaacag 

atatttcaca 

ttgttccgac 

aagaaaacga 

aagattttga 

gagtcgatat 

cgattcatgt 

acggggtcgc 

tcggtttaaa 

tggatattga 

ctcgcaatgt 

ttcaaacaag 

ctgcaaaaga 

atccggaact 

ctttttcaaa 

attaatatgt 

aacctgaact 

cttcctaatg 

attattcctg 

gtgcaggaag 

gcaagccatg 

gggaacgaac 

tttcagcctg 

tggcggcagt 

tagcgccgat 



cttcgtcgac 
tgtgactgta 
ttttaaccaa 
acaagagcgg 
aattttgcaa 
caggacgctc 
gctgaaagaa 
gatggggttt 
cgccgttatt 
agcatatccg 
tctttttacg 
agaaagacgc 
gatcgtactg 
agatgcgcag 
attggttcct 
atacttaaca 
tgcggaactt 
tatcattgaa 
cgagcctgtg 
agcgcgttta 
atcgtacgat 
atgtgggaag 
aaaaagtaca 
gtaaacgggg 
gagataaggt 
agccgaaagt 
cagcccgtac 
atacagatta 
agcgcggtgg 
ggtagtgtgg 



cgaaacagca 
aaaagaaatc 
ctgccataac 
aaagatgttt 
aaagttgttg 
tagattagaa 
gccataaaac 
ctgcatgagg 
atgagtattt 
cgcgatattg 
ccagatgctc 
acagacgtgt 
acgaagcttt 
caggtagctg 
gtcgatacgg 
gctgaggaaa 
gtccaagccg 
acgactagcg 
aatgaaattg 
atagataata 
gatgagcggc 
cattacaatt 
aattgtgaat 
aagcggcgtc 
cattattatt 
ggctgttctg 
aattttgtaa 
aatcagaacg 
tcccacctga 
ggtctcccca 



gttataaggc 
gaaaaagacc 
tcgaggccta 
tgttctacat 
actttatcta 
aggaggattt 
aataccattc 
ggcatttaac 
ttgtgaatcc 
agcgggatgc 
atgatatgta 
tatgcgggcg 
tcaatctagt 
ttgttgatgg 
tcagagagga 
gaaaagaagc 
gtgaaagaga 
gaaccataga 
ctggaaagat 
tcattattga 
aaacttcaca 
gatgaagatc 
a^taataatg 
atatgcttaa 
tcctacaaaa 
aatgatcaaa 
aggatcctgt 
cagaagcggt 
ccccatgccg 
tgcgagagta 



atgaagctgt 
gttttgtgtg 
cctagcttcc 
ccagaacaac 
caaggtgtgg 
aaatatgaga 
agagggcaag 
cttagcagac 
tgcacaattc 
agctcttgca 
tcccggtgaa 
ctcaagagaa 
caagccgact 
gttaatcagc 
agacggctta 
gcctaagctg 
tcctgaagcg 
ctatgtagag 
gattctcgct 
tattcgtaga 
gggcaactgt 
tcattgatgc 
gagcacgtct 
acggtgcagc 
tgatgtctga 
acaaaattga 
tttggcggat 
ctgataaaac 
aactcagaag 
gggaactgcc 



ccggtttttg 
aaaacggtct 
aagaaagata 
ctctgctaaa 
tataataatc 
cagattactg 
tcaatcggat 
aaagcaagac 
ggccctaatg 
gaaaacgccg 
aagaatgtca 
ggacattttg 
cgtgcctatt 
gacttcttca 
gccaaaagct 
tatcgggccc 
gtgataaaag 
ctttattcct 
gttgcagttg 
aaggaggtga 
tacggaagca 
tgtgggaatg 
tgaaacgtat 
cgcacgcctt 
tcaagaagcg 
acaaatgctg 
gagagaagat 
agaatttgcc 
tgaaacgccg 
aggcatcaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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taaaacgaaa 

acgctctcct 
ccggagggtg 
ccatcctgac 
tctttgttgt 
ttgagcggca 
actgtccagc 
atggcccagt 
gacaggttca 
tccgttgtgc 
gctacgcctt 
gttgcctcgc 
aaatcgtatc 
ccagtgtcaa 
tttgctccga 
gtggggaatt 
gaaagaaaag 
tttaatgtgg 
taaagtggga 
gattgattct 
ttaggaggca 
gatatttaat 
tagtgtttta 
cttagtgaca 
agagttaggt 
aacattctct 
cctttctgat 
tgaaaatgct 
tatcaataat 
aggtaattca 
tcatgcagga 
tttataatat 
ctgccccgtt 
tacgcctatt 
cttttcgggg 
tgtatccgct 
gtatgagtat 
ctgtttttgc 
cacgagtggg 
ccgaagaacg 
cccgtattga 
tggttgagta 
tatgcagtgc 
tcggaggacc 
ttgatcgttg 
tgcctgcagc 
cttcccggca 
gctcggccct 
ctcgcggtat 
acacgacggg 
cctcactgat 
atttaaaact 
tgaccaaaat 
tcaaaggatc 
aaccaccgct 
aggtaactgg 
taggccacca 
taccagtggc 
agttaccgga 
tggagcgaac 



ggctcagtcg 

gagtaggaca 
gcgggcagga 
ggatggcctt 
agcccatcac 
gctgagtcgc 
cgttcggtcc 
caagcgctgt 
tcacatctgc 
cgcttccgcc 
taatcgttcc 
ccctcggatc 
ctttatattg 
taatcgccac 
tataaggcgc 
ctcatgtttg 
gatttttcgc 
tctttattct 
tatttttaaa 
aatgaagaaa 
tatcaaatga 
cattatttga 
taccgaaaca 
agggtgataa 
tattgggata 
ggtatttgga 
gtagagaaat 
ttttctcttt 
aatagtaatt 
atatatttac 
ttgtttatga 
gagataatgc 
agttgaagaa 
tttataggtt 
aaatgtgcgc 
catgagacaa 
tcaacatttc 
tcacccagaa 
ttacatcgaa 
ttttccaatg 
cgccgggcaa 
ctcaccagtc 
tgccataacc 
gaaggagcta 
ggaaccggag 
aatggcaaca 
acaattaata 
tccggctggc 
cattgcagca 
gagtcaggca 
taagcattgg 
tcatttttaa 
cccttaacgt 
ttcttgagat 
accagcggtg 
cttcagcaga 
cttcaagaac 
tgctgccagt 
taaggcgcag 
gacctacacc 



aaagactggg 

aatccgccgg 
cgcccgccat 
tttgcgtttc 
ttttgctgaa 
accgacagaa 
gctgttgccg 
gcttgtcgcc 
cccgtcctgc 
aggccctaac 
gtttgcagcc 
gccggttggt 
tccaaagttt 
cttgatgcct 
actgtcatcc 
acagcttatc 
tacgctcaaa 
tcaactaaag 
atatatattt 
gcagacaagt 
actttaataa 
accaacaaac 
taaaacaaga 
actcaaatac 
agttagagcc 
ctcctgtaaa 
ataatggttc 
ctattattcc 
accttctacc 
cgctatcttt 
actctattca 
cgactgtact 
cgaagcggcc 
aatgtcatga 
ggaaccccta 
taaccctgat 
cgtgtcgccc 
acgctggtga 
ctggatctca 
atgagcactt 
gagcaactcg 
acagaaaagc 
atgagtgata 
accgcttttt 
ctgaatgaag 
acgttgcgca 
gactggatgg 
tggtttattg 
ctggggccag 
actatggatg 
taactgtcag 
tttaaaagga 
gagttttcgt 
cctttttttc 
gtttgtttgc 
gcgcagatac 
tctgtagcac 
ggcgataagt 
cggtcgggct 
gaactgagat 



cctttcgttt 

gagcggattt 
aaactgccag 
tacaaactct 
gagtaggagc 
atcgcttctc 
tttgaggtaa 
cagtccgggt 
actgcacgtt 
acacgataag 
acagttccgg 
gtttcttttg 
ttcttcagat 
tttcctgtgt 
atttgcggag 
atgcaatagt 
tcctttaaaa 
cacccattag 
atgttacagt 
aagcctccta 
aattgattta 
gacttttagt 
aggatataaa 
agcttttaga 
actttataca 
gaatgacttc 
ggggaaattg 
atggacttca 
cattattaca 
acaggtacat 
ggaattgtca 
ttttacagtc 
gcaattcttg 
taataatggt 
tttgtttatt 
aaatgcttca 
ttattccctt 
aagtaaaaga 
acagcggtaa 
ttaaagttct 
gtcgccgcat 
atcttacgga 
acactgcggc 
tgcacaacat 
ccataccaaa 
aactattaac 
aggcggataa 
ctgataaatc 
atggtaagcc 
aacgaaatag 
accaagttta 
tctaggtgaa 
tccactgagc 
tgcgcgtaat 
cggatcaaga 
caaatactgt 
cgcctacata 
cgtgtcttac 
gaacgggggg 
acctacagcg 



tatctgttgt 

gaacgttgcg 
gcatcaaatt 
tggtaccgag 
cgaaagtgac 
ttgatgtgcc 
cagcgacaac 
tgtttaaaga 
ccacgcccgc 
caagaagtgt 
ctacgtgtgt 
gatcgtaatc 
ctgggtgatt 
agcctaaatc 
atacggcgtc 
tacccttatt 
aaacacaaaa 
ttcaacaaac 
aatattgact 
aattcacttt 
gacaattgga 
ataaccacag 
ttttaccctg 
actggttaca 
atttttgatg 
aaagagtttt 
tttcccaaaa 
tttactgggt 
gcaggaaaat 
cattctgttt 
gataggccta 
ggttttctaa 
aagacgaaag 
ttcttagacg 
tttctaaata 
ataatattga 
ttttgcggca 
tgctgaagat 
gatccttgag 
gctatgtggc 
acactattct 
tggcatgaca 
caacttactt 
gggggatcat 
cgacgagcgt 
tggcgaacta 
agttgcagga 
tggagccggt 
ctcccgtatc 
acagatcgct 
ctcatatata 
gatccttttt 
gtcagacccc 
ctgctgcttg 
gctaccaact 
ccttctagtg 
cctcgctctg 
cgggttggac 
ttcgtgcaca 
tgagctatga 



ttgtcggtga 

aagcaacggc 
aagcagaagg 
acgatcgtcc 
ggcgtattca 
cggcgatccg 
gccttctgac 
gtttccgaga 
gatgacgttt 
ggcatcaggc 
gccatggtca 
attgtccaca 
gtattcaacc 
ccatgcatcg 
ttcggagatt 
atcaagataa 
gaccacattt 
gaaaattgga 
tttaaaaaag 
agataaaaat 
agagaaaaga 
aaattgatat 
catttatttt 
atagcgacgg 
gtgtatctaa 
atgatttata 
cacctatacc 
ttaacttaaa 
tcattaataa 
gtgatggtta 
atgactggct 
tgtcactaac 
ggcctcgtga 
tcaggtggca 
cattcaaata 
aaaaggaaga 
ttttgccttc 
cagttgggtg 
agttttcgcc 
gcggtattat 
cagaatgact 
gtaagagaat 
ctgacaacga 
gtaactcgcc 
gacaccacga 
cttactctag 
ccacttctgc 
gagcgtgggt 
gtagttatct 
gagataggtg 
ctttagattg 
gataatctca 
gtagaaaaga 
caaacaaaaa 
ctttttccga 
tagccgtagt 
ctaatcctgt 
tcaagacgat 
cagcccagct 
gaaagcgcca 



1860 

1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
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cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag 5460 

agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc 5520 

gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga 5580 

aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca 564 0 

tgttctttcc tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag 5700 

ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg 5760 

aagagcgcct gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat 5820 

ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagtat acactccgct 5880 

atcgctacgt gactgggtca tggctgcgcc ccgacacccg ccaacacccg ctgacgcgcc 594 0 

ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag 6000 

ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgaggcagc tgcggtaaag 6060 

ctcatcagcg tggtcgtgaa gcgattcaca gatgtctgcc tgttcatccg cgtccagctc 6120 

gttgagtttc tccagaagcg ttaatgtctg gcttctgata aagcgggcca tgttaagggc 6180 

ggttttttcc tgtttggtca cttgatgcct ccgtgtaagg gggaatttct gttcatgggg 624 0 

gtaatgatac cgatgaaacg agagaggatg ctcacgatac gggttactga tgatgaacat 6300 

gcccggttac tggaacgttg tgagggtaaa caactggcgg tatggatgcg gcgggaccag 6360 

agaaaaatca ctcagggtca atgccagcgc ttcgttaata cagatgtagg tgttccacag 6420 

ggtagccagc agcatcctgc gatgcagatc cggaacataa tggtgcaggg cgctgacttc 6480 

cgcgtttcca gactttacga aacacggaaa ccgaagacca ttcatgttgt tgctcaggtc 654 0 

gcagacgttt tgcagcagca gtcgcttcac gttcgctcgc gtatcggtga ttcattctgc 6600 

taaccagtaa ggcaaccccg ccagcctagc cgggtcctca acgacaggag cacgatcatg 6660 

cgcacccgtg gccaggaccc aacgctgccc gagatgcgcc gcgtgcggct gctggagatg 6720 

gcggacgcga tggatatgtt ctgccaaggg ttggtttgcg cattcacagt tctccgcaag 6780 

aattgattgg ctccaattct tggagtggtg aatccgttag cgaggtgccg ccggcttcca 6840 

ttcaggtcga ggtggcccgg ctccatgcac cgcgacgcaa cgcggggagg cagacaaggt 6900 

atagggcggc gcctacaatc catgccaacc cgttccatgt gctcgccgag gcggcataaa 6960 

tcgccgtgac gatcagcggt ccagtgatcg aagttaggct ggtaagagcc gcgagcgatc 7020 

cttgaagctg tccctgatgg tcgtcatcta cctgcctgga cagcatggcc tgcaacgcgg 7080 

gcatcccgat gccgccggaa gcgagaagaa tcataatggg gaaggccatc cagcctcgcg 7140 

<210> 22 
<211> 6725 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
<220> 

<223> pAN636 
<400> 22 

tcggcggccg cttcgtcgac cgaaacagca gttataaggc atgaagctgt ccggtttttg 60 

caaaagtggc tgtgactgta aaaagaaatc gaaaaagacc gttttgtgtg aaaacggtct 120 

ttttgtttcc ttttaaccaa ctgccataac tcgaggccta cctagcttcc aagaaagata 180 

tcctaacagc acaagagcgg aaagatgttt tgttctacat ccagaacaac ctctgctaaa 240 

attcctgaaa aattttgcaa aaagttgttg actttatcta caaggtgtgg tataataatc 300 

ttaacaacag caggacgctc tagaaggagg agaaaacatg aaaacaaaac tggattttct 360 

aaaaatgaag gagtctgaag aaccgattgt catgctgacc gcttatgatt atccggcagc 420 

taaacttgct gaacaagcgg gagttgacat gattttagtc ggtgattcac ttggaatggt 480 

cgtcctcggc cttgattcaa ctgtcggtgt gacagttgcg gacatgatcc atcatacaaa 540 

agccgttaaa aggggtgcgc cgaatacctt tattgtgaca gatatgccgt ttatgtctta 600 

tcacctgtct aaggaagata cgctgaaaaa tgcagcggct atcgttcagg aaagcggagc 660 

tgacgcactg aagcttgagg gcggagaagg cgtgtttgaa tccattcgcg cattgacgct 720 

tggaggcatt ccagtagtca gtcacttagg tttgacaccg cagtcagtcg gcgtactggg 780 

cggctataaa gtacagggca aagacgaaca aagcgccaaa aaattaatag aagacagtat 840 

aaaatgcgaa gaagcaggag ctatgatgct tgtgctggaa tgtgtgccgg cagaactcac 900 

agccaaaatt gccgagacgc taagcatacc ggtcattgga atcggggctg gtgtgaaagc 960 

ggacggacaa gttctcgttt atcatgatat tatcggccac ggtgttgaga gaacacctaa 1020 

atttgtaaag caatatacgc gcattgatga aaccatcgaa acagcaatca gcggatatgt 1080 
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tcaggatgta agacatcgtg ctttccctga 

agtgcttgac ggcttgtacg ggggaaaata 
aagattttca gcctgataca gattaaatca 
ttgcctggcg gcagtagcgc ggtggtccca 
cgccgtagcg ccgatggtag tgtggggtct 
tcaaataaaa cgaaaggctc agtcgaaaga 
ggtgaacgct ctcctgagta ggacaaatcc 
acggcccgga gggtggcggg caggacgccc 
gaaggccatc ctgacggatg gcctttttgc 
cgtcctcttt gttgtagccc atcacttttg 
attcattgag cggcagctga gtcgcaccga 
atccgactgt ccagccgttc ggtccgctgt 
ctgacatggc ccagtcaagc gctgtgcttg 
cgagagacag gttcatcaca tctgccccgt 
cgttttccgt tgtgccgctt ccgccaggcc 
caggcgctac gcctttaatc gttccgtttg 
ggtcagttgc ctcgcccctc ggatcgccgg 
ccacaaaatc gtatccttta tattgtccaa 
caaccccagt gtcaataatc gccaccttga 
catcgtttgc tccgatataa ggcgcactgt 
agattgtggg gaattctcat gtttgacagc 
gataagaaag aaaaggattt ttcgctacgc 
cattttttaa tgtggtcttt attcttcaac 
ttggataaag tgggatattt ttaaaatata 
aaaaggattg attctaatga agaaagcaga 
aaaatttagg aggcatatca aatgaacttt 
aaagagatat ttaatcatta tttgaaccaa 
gatattagtg ttttataccg aaacataaaa 
attttcttag tgacaagggt gataaactca 
gacggagagt taggttattg ggataagtta 
tctaaaacat tctctggtat ttggactcct 
ttataccttt ctgatgtaga gaaatataat 
atacctgaaa atgctttttc tctttctatt 
ttaaatatca ataataatag taattacctt 
aataaaggta attcaatata tttaccgcta 
ggttatcatg caggattgtt tatgaactct 
tggcttttat aatatgagat aatgccgact 
ctaacctgcc ccgttagttg aagaacgaag 
cgtgatacgc ctatttttat aggttaatgt 
tggcactttt cggggaaatg tgcgcggaac 
aaatatgtat ccgctcatga gacaataacc 
gaagagtatg agtattcaac atttccgtgt 
ccttcctgtt tttgctcacc cagaaacgct 
gggtgcacga gtgggttaca tcgaactgga 
tcgccccgaa gaacgttttc caatgatgag 
attatcccgt attgacgccg ggcaagagca 
tgacttggtt gagtactcac cagtcacaga 
agaattatgc agtgctgcca taaccatgag 
aacgatcgga ggaccgaagg agctaaccgc 
tcgccttgat cgttgggaac cggagctgaa 
cacgatgcct gcagcaatgg caacaacgtt 
tctagcttcc cggcaacaat taatagactg 
tctgcgctcg gcccttccgg ctggctggtt 
tgggtctcgc ggtatcattg cagcactggg 
tatctacacg acggggagtc aggcaactat 
aggtgcctca ctgattaagc attggtaact 
gattgattta aaacttcatt tttaatttaa 
tctcatgacc aaaatccctt aacgtgagtt 
aaagatcaaa ggatcttctt gagatccttt 
aaaaaaacca ccgctaccag cggtggtttg 



acaaaagcat tcctttcaaa tgaaccagac 1140 

agggggggat cctgttttgg 6ggatgagag 1200 
gaacgcagaa gcggtctgat aaaacagaat 1260 
cctgacccca tgccgaactc agaagtgaaa 1320 
ccccatgcga gagtagggaa ctgccaggca 1380 
ctgggccttt cgttttatct gttgtttgtc 1440 
gccgggagcg gatttgaacg ttgcgaagca 1500 
gccataaact gccaggcatc aaattaagca 1560 
gtttctacaa actcttggta ccgagacgat 1620 
ctgaagagta ggagccgaaa gtgacggcgt 1680 
cagaaatcgc ttctcttgat gtgcccggcg 1740 
tgccgtttga ggtaacagcg acaacgcctt 1800 
tcgcccagtc cgggttgttt aaagagtttc 1860 
cctgcactgc acgttccacg cccgcgatga 1920 
ctaacacacg ataagcaaga agtgtggcat 1980 
cagccacagt tccggctacg tgtgtgccat 2040 
ttggtgtttc ttttggatcg taatcattgt 2100 
agtttttctt cagatctggg tgattgtatt 2160 
tgccttttcc tgtgtagcct aaatcccatg 2220 
catccatttg cggagatacg gcgtcttcgg 2280 
ttatcatgca atagttaccc ttattatcaa 2340 
tcaaatcctt taaaaaaaca caaaagacca 2400 
taaagcaccc attagttcaa caaacgaaaa 2460 
tatttatgtt acagtaatat tgacttttaa 2520 
caagtaagcc tcctaaattc actttagata 2580 
aataaaattg atttagacaa ttggaagaga 2640 
caaacgactt ttagtataac cacagaaatt 2700 
caagaaggat ataaatttta ccctgcattt 2760 
aatacagctt ttagaactgg ttacaatagc 2820 
gagccacttt atacaatttt tgatggtgta 2880 
gtaaagaatg acttcaaaga gttttatgat 2940 
ggttcgggga aattgtttcc caaaacacct 3000 
attccatgga cttcatttac tgggtttaac 3060 
ctacccatta ttacagcagg aaaattcatt 3120 
tctttacagg tacatcattc tgtttgtgat 3180 
attcaggaat tgtcagatag gcctaatgac 3240 
gtacttttta cagtcggttt tctaatgtca 3300 
cggccgcaat tcttgaagac gaaagggcct 3360 
catgataata atggtttctt agacgtcagg 3420 
ccctatttgt ttatttttct aaatacattc 3480 
ctgataaatg cttcaataat attgaaaaag 3540 
cgcccttatt cccttttttg cggcattttg 3600 
ggtgaaagta aaagatgctg aagatcagtt 3660 
tctcaacagc ggtaagatcc ttgagagttt 3720 
cacttttaaa gttctgctat gtggcgcggt 3780 
actcggtcgc cgcatacact attctcagaa 3840 
aaagcatctt acggatggca tgacagtaag 3900 
tgataacact gcggccaact tacttctgac 3960 
ttttttgcac aacatggggg atcatgtaac 4020 
tgaagccata ccaaacgacg agcgtgacac 4080 
gcgcaaacta ttaactggcg aactacttac 4140 
gatggaggcg gataaagttg caggaccact 4200 
tattgctgat aaatctggag ccggtgagcg 4260 
gccagatggt aagccctccc gtatcgtagt 4320 
ggatgaacga aatagacaga tcgctgagat 4380 
gtcagaccaa gtttactcat atatacttta 4 440 
aaggatctag gtgaagatcc tttttgataa 4500 
ttcgttccac tgagcgtcag accccgtaga 4560 
ttttctgcgc gtaatctgct gcttgcaaac 4 620 
tttgccggat caagagctac caactctttt 4 680 
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tccgaaggta 

gtagttaggc 
cctgttacca 
acgatagtta 
cagcttggag 
cgccacgctt 
aggagagcgc 
gtttcgccac 
atggaaaaac 
tcacatgttc 
gtgagctgat 
agcggaagag 
catatggtgc 
ccgctatcgc 
gcgccctgac 
gggagctgca 
taaagctcat 
agctcgttga 
agggcggttt 
tgggggtaat 
aacatgcccg 
accagagaaa 
cacagggtag 
acttccgcgt 
aggtcgcaga 
tctgctaacc 
tcatgegcac 
agatggcgga 
gcaagaattg 
ttccattcag 
aaggtatagg 
ataaatcgcc 
cgatccttga 
cgcgggcatc 
tcgcg 



actggcttca 
caccacttca 
gtggctgctg 
ccggataagg 
cgaacgacct 
cccgaaggga 
acgagggagc 
ctctgacttg 
gccagcaacg 
tttcctgcgt 
accgctcgcc 
cgcctgatgc 
actctcagta 
tacgtgactg 
gggcttgtct 
tgtgtcagag 
cagcgtggtc 
gtttctccag 
tttcctgttt 
gataccgatg 
gttactggaa 
aatcactcag 
ccagcagcat 
ttccagactt 
cgttttgcag 
agtaaggcaa 
ccgtggccag 
cgcgatggat 
attggctcca 
gtcgaggtgg 
gcggcgccta 
gtgacgatca 
agctgtccct 
ccgatgccgc 



gcagagcgca 

agaactctgt 
ccagtggcga 
cgcagcggtc 
acaccgaact 
gaaaggcgga 
ttccaggggg 
agcgtcgatt 
cggccttttt 
tatcccctga 
gcagccgaac 
ggtattttct 
caatctgctc 
ggtcatggct 
gctcccggca 
gttttcaccg 
gtgaagcgat 
aagcgttaat 
ggtcacttga 
aaacgagaga 
cgttgtgagg 
ggtcaatgcc 
cctgcgatgc 
tacgaaacac 
cagcagtcgc 
ccccgccagc 
gacccaacgc 
atgttctgcc 
attcttggag 
cccggctcca 
caatccatgc 
gcggtccagt 
gatggtcgtc 
cggaagcgag 



gataccaaat 
agcaccgcct 
taagtcgtgt 
gggctgaacg 
gagataccta 
caggtatccg 
aaacgcctgg 
tttgtgatgc 
acggttcctg 
ttctgtggat 
gaccgagcgc 
ccttacgcat 
tgatgccgca 
gcgccccgac 
tccgcttaca 
tcatcaccga 
tcacagatgt 
gtctggcttc 
tgcctccgtg 
ggatgctcac 
gtaaacaact 
agcgcttcgt 
agatccggaa 
ggaaaccgaa 
ttcacgttcg 
ctagccgggt 
tgcccgagat 
aagggttggt 
tggtgaatcc 
tgcaccgcga 
caacccgttc 
gatcgaagtt 
atctacctgc 
aagaatcata 



actgtccttc 

acatacctcg 
cttaccgggt 
gggggttcgt 
cagcgtgagc 
gtaagcggca 
tatctttata 
tcgtcagggg 
gccttttgct 
aaccgtatta 
agcgagtcag 
ctgtgcggta 
tagttaagcc 
acccgccaac 
gacaagctgt 
aacgcgcgag 
ctgcctgttc 
tgataaagcg 
taagggggaa 
gatacgggtt 
ggcggtatgg 
taatacagat 
cataatggtg 
gaccattcat 
ctcgcgtatc 
cctcaacgac 
gcgccgcgtg 
ttgcgcattc 
gttagcgagg 
cgcaacgcgg 
catgtgctcg 
aggctggtaa 
ctggacagca 
atggggaagg 



tagtgtagcc 

ctctgctaat 
tggactcaag 
gcacacagcc 
tatgagaaag 
gggtcggaac 
gtcctgtcgg 
ggcggagcct 
ggccttttgc 
ccgcctttga 
tgagcgagga 
tttcacaccg 
agtatacact 
acccgctgac 
gaccgtctcc 
gcagctgcgg 
atccgcgtcc 
ggccatgtta 
tttctgttca 
actgatgatg 
atgcggcggg 
gtaggtgttc 
cagggcgctg 
gttgttgctc 
ggtgattcat 
aggagcacga 
cggctgctgg 
acagttctcc 
tgccgccggc 
ggaggcagac 
ccgaggcggc 
gagccgcgag 
tggcctgcaa 
ccatccagcc 



4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6725 



<210> 23 
<211> 6806 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: vector 
<220> 

<223> pAN637 



<400> 23 

tcggcggccg cttcgtcgac cgaaacagca gttataaggc atgaagctgt ccggtttttg 60 
caaaagtggc tgtgactgta aaaagaaatc gaaaaagacc gttttgtgtg aaaacggtct 120 
ttttgtttcc ttttaaccaa ctgccataac tcgaggccta cctagcttcc aagaaagata 180 
tcctaacagc acaagagcgg aaagatgttt tgttctacat ccagaacaac ctctgctaaa 240 
attcctgaaa aattttgcaa aaagttgttg actttatcta caaggtgtgg tataataatc 300 
ttaacaacag caggacgctc tagacaattg agatcttaag aaaggaggtg ttaattaatg 360 
aagattggaa tcattggcgg aggctccgtt ggtcttttat gcgcctatta tttgtcactt 420 
tatcacgacg tgactgttgt gacgaggcgg caagaacagg ctgcggccat tcagtctgaa 480 
ggaatccggc tttataaagg cggggaggaa ttcagggctg attgcagtgc ggacacgagt 540 
atcaattcgg actttgacct gcttgtcgtg acagtgaagc agcatcagct tcaatctgtt 600 
ttttcgtcgc ttgaacgaat cgggaagacg aatatattat ttttgcaaaa cggcatgggg 660 
catatccacg acctaaaaga ctggcacgtt ggccattcca tttatgttgg aatcgttgag 720 
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cacggagctg 
tggagcgcgt 
tcggattttc 
gtaaatgcgt 
acaacgccag 
aaacttgaaa 
gagaatcgtt 
attatcggat 
tttttatatg 
tcctgttttg 
agcggtctga 
atgccgaact 
agagtaggga 
tcgttttatc 
ggatttgaac 
tgccaggcat 
aactcttggt 
aggagccgaa 
cttctcttga 
aggtaacagc 
ccgggttgtt 
cacgttccac 
gataagcaag 
ttccggctac 
cttttggatc 
tcagatctgg 
ctgtgtagcc 
gcggagatac 
aatagttacc 
ttaaaaaaac 
cattagttca 
tacagtaata 
ctcctaaatt 
gatttagaca 
tttagtataa 
tataaatttt 
tttagaactg 
tatacaattt 
gacttcaaag 
aaattgtttc 
acttcattta 
attacagcag 
gtacatcatt 
ttgtcagata 
acagtcggtt 
ttcttgaaga 
aatggtttct 
tttatttttc 
gcttcaataa 
tccctttttt 
aaaagatgct 
cggtaagatc 
agttctgcta 
ccgcatacac 
tacggatggc 
tgcggccaac 
caacatgggg 
accaaacgac 
attaactggc 
ggataaagtt 



taagaaaatc 
tcgacgatgc 
cgatttatta 
gtattaatcc 
cttatctggc 
atgaagaaaa 
catcaatgct 
acttattgaa 
gcagcatcaa 
gcggatgaga 
taaaacagaa 
cagaagtgaa 
actgccaggc 
tgttgtttgt 
gttgcgaagc 
caaattaagc 
accgagacga 
agtgacggcg 
tgtgcccggc 
gacaacgcct 
taaagagttt 
gcccgcgatg 
aagtgtggca 
gtgtgtgcca 
gtaatcattg 
gtgattgtat 
taaatcccat 
ggcgtcttcg 
cttattatca 
acaaaagacc 
acaaacgaaa 
ttgactttta 
cactttagat 
attggaagag 
ccacagaaat 
accctgcatt 
gttacaatag 
ttgatggtgt 
agttttatga 
ccaaaacacc 
ctgggtttaa 
gaaaattcat 
ctgtttgtga 
ggcctaatga 
ttctaatgtc 
cgaaagggcc 
tagacgtcag 
taaatacatt 
tattgaaaaa 
gcggcatttt 
gaagatcagt 
cttgagagtt 
tgtggcgcgg 
tattctcaga 
atgacagtaa 
ttacttctga 
gatcatgtaa 
gagcgtgaca 
gaactactta 
gcaggaccac 



ggatacagct 
tgaaccagac 
tgagacggat 
tttaactgcg 
ttttatgaag 
ggcttgggag 
ggttgacgtc 
ggaagcaagt 
agcattggag 
gaagattttc 
tttgcctggc 
acgccgtagc 
atcaaataaa 
cggtgaacgc 
aacggcccgg 
agaaggccat 
tcgtcctctt 
tattcattga 
gatccgactg 
tctgacatgg 
ccgagagaca 
acgttttccg 
tcaggcgcta 
tggtcagttg 
tccacaaaat 
tcaaccccag 
gcatcgtttg 
gagattgtgg 
agataagaaa 
acatttttta 
attggataaa 
aaaaaggatt 
aaaaatttag 
aaaagagata 
tgatattagt 
tattttctta 
cgacggagag 
atctaaaaca 
tttatacctt 
tatacctgaa 
cttaaatatc 
taataaaggt 
tggttatcat 
ctggctttta 
actaacctgc 
tcgtgatacg 
gtggcacttt 
caaatatgta 
ggaagagtat 
gccttcctgt 
tgggtgcacg 
ttcgccccga 
tattatcccg 
atgacttggt 
gagaattatg 
caacgatcgg 
ctcgccttga 
ccacgatgcc 
ctctagcttc 
ttctgcgcto- 



gttgatcata 
cggctgaaca 
tggtaccgtc 
ttattgcaag 
ctggtatttc 
cgggttcagg 
attggaggcc 
cttcaaggtc 
cgaaatacca 
agcctgatac 
ggcagtagcg 
gccgatggta 
acgaaaggct 
tctcctgagt 
agggtggcgg 
cctgacggat 
tgttgtagcc 
gcggcagctg 
tccagccgtt 
cccagtcaag 
ggttcatcac 
ttgtgccgct 
cgcctttaat 
cctcgcccct 
cgtatccttt 
tgtcaataat 
ctccgatata 
ggaattctca 
gaaaaggatt 
atgtggtctt 
gtgggatatt 
gattctaatg 
gaggcatatc 
tttaatcatt 
gttttatacc 
gtgacaaggg 
ttaggttatt 
ttctctggta 
tctgatgtag 
aatgcttttt 
aataataata 
aattcaatat 
gcaggattgt 
taatatgaga 
cccgttagtt 
cctattttta 
tcggggaaat 
tccgctcatg 
gagtattcaa 
ttttgctcac 
agtgggttac 
agaacgtttt 
tattgacgcc 
tgagtactca 
cagtgctgcc 
aggaccgaag 
tcgttgggaa 
tgcagcaatg 
ccggcaacaa 
ggcccttccg 



caggcctagg 
tcttgtttca 
tgctgacggg 
tgaaaaatgg 
aggaggcatg 
ccgtttgtgg 
ggcagacgga 
ttgatgccgt 
acaaagtggt 
agattaaatc 
cggtggtccc 
gtgtggggtc 
cagtcgaaag 
aggacaaatc 
gcaggacgcc 
ggcctttttg 
catcactttt 
agtcgcaccg 
cggtccgctg 
cgctgtgctt 
atctgccccg 
tccgccaggc 
cgttccgttt 
cggatcgccg 
atattgtcca 
cgccaccttg 
aggcgcactg 
tgtttgacag 
tttcgctacg 
tattcttcaa 
tttaaaatat 
aagaaagcag 
aaatgaactt 
atttgaacca 
gaaacataaa 
tgataaactc 
gggataagtt 
tttggactcc 
agaaatataa 
ctctttctat 
gtaattacct 
atttaccgct 
ttatgaactc 
taatgccgac 
gaagaacgaa 
taggttaatg 
gtgcgcggaa 
agacaataac 
catttccgtg 
ccagaaacgc 
atcgaactgg 
ccaatgatga 
gggcaagagc 
ccagtcacag 
ataaccatga 
gagctaaccg 
ccggagctga 
gcaacaacgt 
ttaatagact 
gctggctggt 



tgcgataaaa 
gcataaccat 
caagctgatt 
agaactgctg 
ccgcatttta 
gcaaacgaaa 
agctgacgcc 
ccacctagag 
ttactaagga 
agaacgcaga 
acctgacccc 
tccccatgcg 
actgggcctt 
cgccgggagc 
cgccataaac 
cgtttctaca 
gctgaagagt 
acagaaatcg 
ttgccgtttg 
gtcgcccagt 
tcctgcactg 
cctaacacac 
gcagccacag 
gttggtgttt 
aagtttttct 
atgccttttc 
tcatccattt 
cttatcatgc 
ctcaaatcct 
ctaaagcacc 
atatttatgt 
acaagtaagc 
taataaaatt 
acaaacgact 
acaagaagga 
aaatacagct 
agagccactt 
tgtaaagaat 
tggttcgggg 
tattccatgg 
tctacccatt 
atctttacag 
tattcaggaa 
tgtacttttt 
gcggccgcaa 
tcatgataat 
cccctatttg 
cctgataaat 
tcgcccttat 
tggtgaaagt 
atctcaacag 
gcacttttaa 
aactcggtcg 
aaaagcatct 
gtgataacac 
cttttttgca 
atgaagccat 
tgcgcaaact 
ggatggaggc 
ttattgctga 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 
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taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg 4380 

taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg 44 40 

aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca 4500 

agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta 4560 

ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca 4 620 

ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg 4 680 

cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga 4740 

tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa 4800 

tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc 4860 

tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg 4 920 

tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac 4980 

ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct 5040 

acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc 5100 

ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg 5160 

gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg 5220 

ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct 5280 

ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga 5340 

taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg 5400 

cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca 54 60 

tctgtgcggt atttcacacc gcatatggtg cactctcagt acaatctgct ctgatgccgc 5520 

atagttaagc cagtatacac tccgctatcg ctacgtgact gggtcatggc tgcgccccga 5580 

cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac 5640 

agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg 5700 

aaacgcgcga ggcagctgcg gtaaagctca tcagcgtggt cgtgaagcga ttcacagatg 5760 

tctgcctgtt catccgcgtc cagctcgttg agtttctcca gaagcgttaa tgtctggctt 5820 

ctgataaagc gggccatgtt aagggcggtt ttttcctgtt tggtcacttg atgcctccgt 5880 

gtaaggggga atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca 5940 

cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac 6000 

tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg 6060 

ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga 6120 

acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga 6180 

agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc 6240 

gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg 6300 

tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga 6360 

tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg 6420 

tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc 64 80 

cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg 6540 

acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt 6600 

ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt 6660 

taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg 6720 

cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat 6780 

aatggggaag gccatccagc ctcgcg 6806 

<210> 24 
<211> 3867 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: vector 



<400> 24 

aagctttctc aagaagcgaa caagaaaaaa 

gtcgctagat tcagcgccaa tgcgtctaaa 

ctcgaaaaaa tcacgctgga tgatattaaa 

ttcacgccgg aacgggaaat cggaaatgat 

atcgatggcg taaaggtgct tgacaatgtc 

gctttcactg gccgaaatga acttgctgtt 

atggaagctg acagcggaac gtttaaatgg 

aaagacaaca gcgaatattt cgaaggcagt 



gaagagcaga ttaaacagct tcaagagttt 60 
tctaagcagg ctacatcaag aaagaaactt 120 
ccgtcttccc gccgctatcc ttatgttaac 180 
gttcttcgcg tggaaggctt aacaaaaaca 240 
agctttatta tgaatcgaga agataaaatt 300 
actacgctgt ttaaaatcat ttccggggaa 360 
ggtgttacca catctcaagc gtattttcca 420 
gatctgaacc ttgtagactg gcttcgccaa 480 
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tattctccgc acgaccaaag tgagagcttt 
tctggagaag aagtccacaa aaaagcaaat 
atgctgtcga aagcgatgct ttctggcgcc 
catttagacc tagagtccat tacagcgctc 
atgctgttta cttcccatga ccatcagttt 
attacaccta acggcatcgt cgataagcaa 
gatgtgcaga aaaaattgac tgaactatac 
ttttttgata cctaaatgtg aaaggagatc 
gtggagtagg cgggtatatt ggcggacggc 
tcgtgcgcca aaaacgagct gagcagctta 
aagggaatgt atcatttcag cccgaactaa 
tcgttatcat tgcttctaaa gcatactcgc 
ttatccatca agaatctgtc attatccctt 
tatttgcggc attttcaaaa gaacaggtgc 
tagacaacaa aggagaaatt catcatacga 
ggaacggcga gcgtacggag cggataagag 
ctgaagtcat cattagcggg catatcgaga 
atcaagataa gaaagaaaag gatttttcgc 
gaccacattt tttaatgtgg tctttattct 
gaaaattgga taaagtggga tatttttaaa 
tttaaaaaag gattgattct aatgaagaaa 
agataaaaat ttaggaggca tatcaaatga 
agagaaaaga gatatttaat cattatttga 
aaattgatat tagtgtttta taccgaaaca 
catttatttt cttagtgaca agggtgataa 
atagcgacgg agagttaggt tattgggata 
gtgtatctaa aacattctct ggtatttgga 
atgatttata cctttctgat gtagagaaat 
cacctatacc tgaaaatgct ttttctcttt 
ttaacttaaa tatcaataat aatagtaatt 
tcattaataa aggtaattca atatatttac 
gtgatggtta tcatgcagga ttgtttatga 
atgactggct tttataatat gagataatgc 
tgtcactaac ctgccccgtt agttgaagaa 
gggatcacta gtccaaacga cagaaggcga 
aaaacgttta tctctcgctg caccagtatt 
tgaagcagaa aaataaaaaa aggaggcgga 
aatttccgtt tctgaagata ggctctcttt 
tatccttaga accgatcata aagtccacgt 
caagctcttc acgagacatc tgctttccgc 
tcgccaaatg atttgacgcg ttttcatcaa 
gtgatatagg cgaatcgtaa ggaacaagtg 
ctgtttccac cagttctttt aaaatatcct 
ggccattttc aaacgtcagg gtgaaatttt 
ttgtgcttga taccactccg tcaaccccgt 
tcggcatatt ggccataaac tcatggccac 
catgcttcct aggcagctta attgttagat 
cgtaatgtct ctcgttcaaa tggtcaactt 
acgcctgaac cgggttgtct tcatttacgc 
ggtggatcgc ttcctcctct gatttgccag 
ccgcacctac gacagtccag ctgactttgt 
atgctttgcc tgctgctttt tggaatgccg 
agtctgggtt cgacgacaca acagaaatga 
tgccttttgc ttcccattca ggatattctt 
ttaatttggc gacttcgtca tcctgccaat 
atgcgtgttt tacaattaaa cggacaaaat 
catactggcc tttttggaca ttaacgc 



ttacgcggtt tcttaggacg catgctgttc 540 
gtactttccg ggggagaaaa ggtccgctgt 600 
aatattttaa ttttggatga gccgaccaac 660 
aataacggct taatcagctt taaaggcgct 720 
gtgcagacca ttgccaacag aattatagaa 780 
atgagctatg acgagttcct tgaaaatgct 840 
gccgaataaa aaagcagaga tttctctgct 900 
acaacatgaa atttttggtt gtcggagcag 960' 
tttcggagaa aggaaatgat gtgacatttc 1020 
aaaaaaccgg gcttgtcatc catagtgaaa 1080 
tcagtgccgg agaaacaggg caatttgatg 1140 
ttggtcaagt gatagaggat gtcaaaccat 1200 
ttttaaatgg gtaccgccac tatgagcagc 1260 
tgggcggcct gtgttttata gaaagtgctt 1320 
gcgcatcgca tcgttttgta tttggagaat 1380 
cgcttgaaga ggcattttca ggtgtgaagg 14 4 0 
agatcccctg cagcaatagt tacccttatt 1500 
tacgctcaaa tcctttaaaa aaacacaaaa 1560 
tcaactaaag cacccattag ttcaacaaac 1620 
atatatattt atgttacagt aatattgact 1680 
gcagacaagt aagcctccta aattcacttt 1740 
actttaataa aattgattta gacaattgga 1800 
accaacaaac gacttttagt ataaccacag 1860 
taaaacaaga aggatataaa ttttaccctg 1920 
actcaaatac agcttttaga actggttaca 198 0 
agttagagcc actttataca atttttgatg 2040 
ctcctgtaaa gaatgacttc aaagagtttt 2100 
ataatggttc ggggaaattg tttcccaaaa 2160 
ctattattcc atggacttca tttactgggt 2220 
accttctacc cattattaca gcaggaaaat 2280 
cgctatcttt acaggtacat cattctgttt 2340 
actctattca ggaattgtca gataggccta 2400 
cgactgtact ttttacagtc ggttttctaa 24 60 
ggtttttata ttacagctcc cgggagatct 2520 
ccacctgcat ggatttttga ttgaaaaagc 2580 
agaaaccgtt tatgcgaatc tgcaaatgta 2640 
aaagcctcct tttatttact taaaaagccc 2700 
tcccgtctgc cgtaattccg tcaatattca 2760 
gtgtaatgct ttcatttagg ccttctttga 2820 
cttcaatatt aaaggcatag gcgcttccga 2880 
acagcgtgtt atagaaaaga atgtttgatt 2940 
ccacttcacc taaatagtga gaaccttcat 3000 
cacctttttc agctttaatg tcgactatac 3060 
caataatatt tccgccgtag cttaatggtt 3120 
ctttttgcgg cagcgtgaac acttcttctg 3180 
tttcattcac gcttcccgca cctgcccaaa 3240 
cagttccttc tgcttgataa tgtaaggcag 3300 
tttcatgaag attttggtca tgattgatcc 3360 
gcgtcgcttt aaaaatttct tcccacagaa 3420 
gaaacacctt gtgagcccag cctgctgatg 3480 
ctgattgaat atattgtctg tatgtatgta 354 0 
caatccgttt ggaatctata ccttttagca 3600 
aagcagctcc atttttggca agctcttctc 3660 
caaatgcttc aaacggcgca agttcgtatt 3720 
tcacggtgac gttttttgcg cccttttcat 3780 
cccgaacgtc tgttgaagca tttacgacta 384 0 

3867 
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<210> 25 
<211> 8704 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: vector 
<220> 

<223> pAN238 



<400> 25 

gcggccgctt cgtcgaccga 
aagtggctgt gactgtaaaa 
tgtttccttt taaccaactg 
taacagcaca agagcggaaa 
cctgaaaaat tttgcaaaaa 
acaacagcag gacgctctag 
tggagtaggc gggtatattg 
cgtgcgccaa aaacgagctg 
agggaatgta tcatttcagc 
cgttatcatt gcttctaaag 
tatccatcaa gaatctgtca 
atttgcggca ttttcaaaag 
agacaacaaa ggagaaattc 
gaacggcgag cgtacggagc 
tgaagtcatc attagcgggc 
agcgcaagcg gggatcacaa 
agccggacgt cacacggccc 
aggtgttccg gctgatccgg 
ttaccatatg aagtcctcca 
ccacctgcat ggatttttga 
agaaaccgtt tatgcgaatc 
aaagcctcct tttatttact 
tcccgtctgc cgggatcctg 
aaatcagaac gcagaagcgg 
gtcccacctg accccatgcc 
gggtctcccc atgcgagagt 
gaaagactgg gcctttcgtt 
aaatccgccg ggagcggatt 
acgcccgcca taaactgcca 
ttttgcgttt ctacaaactc 
gcgaaatcga. agaagctgtc 
tcatctttaa aagtcacagc 
cagaaagata aaggaaatgc 
gttcatgcct caaaagaggt 
cttcctaatg agaaacggca 
cagaaagcta ttaaagcaac 
aaagcttggg cacttggata 
gtggaatgga atcatccggc 
aatgagcctg aaaatgaaat 
gatgatttgg ctcatggaac 
acaaatcaaa tcggtgtagc 
gatggcggca ctgatgctga 
gcggaaggaa atccccaccc 
ggctctggac ttgatgaatg 
ttccctgagt tttcagcggg 
gcaaatccgg caaactatcc 
ccatggagag aaaagaaaat 
tttaaaaagg ctgaaagagt 



aacagcagtt ataaggcatg 
agaaatcgaa aaagaccgtt 
ccataactcg aggcctacct 
gatgttttgt tctacatcca 
gttgttgact ttatctacaa 
aggaggagac taacatgaaa 
gcggacggct ttcggagaaa 
agcagcttaa aaaaaccggg 
ccgaactaat cagtgccgga 
catactcgct tggtcaagtg 
ttatcccttt tttaaatggg 
aacaggtgct gggcggcctg 
atcatacgag cgcatcgcat 
ggataagagc gcttgaagag 
atatcgagaa ggacatttgg 
cgttatttca acgaccgctt 
aaactcttat tggggaaatt 
ctcttgagga agagagcttt 
tgcttcggga tatggaaaac 
ttgaaaaagc aaaacgttta 
tgcaaatgta tgaagcagaa 
taaaaagccc aatttccgtt 
ttttggcgga tgagagaaga 
tctgataaaa cagaatttgc 
gaactcagaa gtgaaacgcc 
agggaactgc caggcatcaa 
ttatctgttg tttgtcggtg 
tgaacgttgc gaagcaacgg 
ggcatcaaat taagcagaag 
ttggtaccca gaaaaagcgg 
tgccgctaag acggaatatc 
cgatgaatcc cagcaagatg 
agaccaaatt cattcttatt 
tatggaaaaa gtggtgcagt 
gctttttaag tcatcctccc 
tgacggtgtg gaatggaatg 
tgatggaact ggcacggttg 
attaaaagag aaatatcgcg 
gaactggtat gatgccgtag 
ccacgtgaca ggcacgatgg 
acctggcgca aaatggattg 
cattttggaa gctggtgaat 
ggaaatggct cctgatgttg 
gtacagagac atggtcaatg 
gaatacggat ctctttattc 
agaatcgttt gcaactggag 
cgctaatgtt gattactttg 
aaaagattgt gctgaaatat 



aagctgtccg gtttttgcaa 60 
ttgtgtgaaa acggtctttt 120 
agcttccaag aaagatatcc 180 
gaacaacctc tgctaaaatt 240 
ggtgtggtat aataatctta 300 
tttttggttg tcggagcagg 360 
ggaaatgatg tgacatttct 420 
cttgtcatcc atagtgaaaa 480 
gaaacagggc aatttgatgt 540 
atagaggatg tcaaaccatt 600 
taccgccact atgagcagct 660 
tgttttatag aaagtgcttt 720 
cgttttgtat ttggagaatg 780 
gcattttcag gtgtgaaggc 840 
aaaaagtatc tctttattgc 900 
ggcccaatcc tcgccacaga 960 
tgcactgttt tacgaaaaga 1020 
cgtacgatga ccagcatgtc 1080 
ggccaaacga cagaaggcga 114 0 
tctctcgctg caccagtatt 1200 
aaataaaaaa aggaggcgga 1260 
tctgaagata ggctctcttt 1320 
ttttcagcct gatacagatt 1380 
ctggcggcag tagcgcggtg 14 40 
gtagcgccga tggtagtgtg 1500 
ataaaacgaa aggctcagtc 1560 
aacgctctcc tgagtaggac 1620 
cccggagggt ggcgggcagg 1680 
gccatcctga cggatggcct 1740 
caaaagcggc tgttaaaaaa 1800 
aaaagcgttc tgctgttgtg 1860 
tcctaaaata cttgaacacc 1920 
atgtggtgaa cgggattgct 1980 
ttcccgaagt ggaaaaggtg 2040 
catttaatat gaaaaaagca 2100 
tagaccaaat cgatgcccca 2160 
ttgcgtccat tgataccggg 2220 
gatataatcc ggaaaatcct 2280 
caggcgaggc aagcccttat 2340 
tgggctctga acctgatgga 2400 
ctgttaaagc gttctctgaa 2460 
gggttttagc accaaaggac 2520 
tcaataactc atggggaggg 2580 
cctggcgttc ggccgatatt 2640 
ccggcgggcc tggttctatc 2700 
cgactgagaa ttccaattcc 2760 
aacttctgca tattcttgaa 2820 
tagagtataa acaaaatcgt 2880 
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gaaacaggcg aaagaaagtt gtatcgagtg tggttttgta aatccaggct ttgtccaatg 2940 
tgcaactgga ggagagcaat gaaacatggc attcagtcac aaaaggttgt tgctgaagtt 3000 
attaaacaaa agccaacagt tcgttggttg tttctcacat taacagttaa aaatgtttat 3060 
gatggcgaag aattaaataa gagtttgtca gatatggctc aaggatttcg ccgaatgatg 3120 
caatataaaa aaattaataa aaatcttgtt ggttttatgc gtgcaacgga agtgacaata 3180 
aataataaag ataattctta taatcagcac atgcatgtat tggtatgtgt ggaaccaact 3240 
tattttaaga atacagaaaa ctacgtgaat caaaaacaat ggattcaatt ttggaaaaag 3300 
gcaatgaaat tagactatga tccaaatgta aaagttcaaa tgattcgacc gaaaaataaa 3360 
tataaatcgg atatacaatc ggcaattgac gaaactgcaa aatatcctgt aaaggatacg 3420 
gattttatga ccgatgatga agaaaagaat ttgaaacgtt tgtctgattt ggaggaaggt 3480 
ttacaccgta aaaggttaat ctcctatggt ggtttgttaa aagaaataca taaaaaatta 3540 
aaccttgatg acacagaaga aggcgatttg attcatacag atgatgacga aaaagccgat 3600 
gaagatggat tttctattat tgcaatgtgg aattgggaac ggaaaaatta ttttattaaa 3660 
gagtagttca acaaacgggc catattgttg tataagtgat gaaatactga atttaaaact 3720 
tagtttatat gtggtaaaat gttttaatca agtttaggag gaattaatta tgaagtgtaa 3780 
tgaatgtaac agggttcaat taaaagaggg aagcgtatca ttaaccctat aaactacgtc 3840 
tgccctcatt attggagggt gaaatgtgaa tacatcctat tcacaatcga atttacgaca 3900 
caaccaaatt ttaatttggc tttgcatttt atcttttttt agcgtattaa atgaaatggt 3960 
tttgaacgtc tcattacctg atattgcaaa tgattttaat aaaccacctg cgagtacaaa 4020 
ctgggtgaac acagccttta tgttaacctt ttccattgga acagctgtat atggaaagct 4080 
atctgatcaa ttaggcatca aaaggttact cctatttgga attataataa attgtttcgg 4140 
gtcggtaatt gggtttgttg gccattcttt cttttcctta cttattatgg ctcgttttat 4200 
tcaaggggct ggtgcagctg catttccagc actcgtaatg gttgtagttg cgcgctatat 4260 
tccaaaggaa aataggggta aagcatttgg tcttattgga tcgatagtag ccatgggaga 4320 
aggagtcggt ccagcgattg gtggaatgat agcccattat attcattggt cctatcttct 4 380 
actcattcct atgataacaa ttatcactgt tccgtttctt atgaaattat taaagaaaga 4 4 40 
agtaaggata aaaggtcatt ttgatatcaa aggaattata ctaatgtctg taggcattgt 4500 
attttttatg ttgtttacaa catcatatag catttctttt cttatcgtta gcgtgctgtc 4560 
attcctgata tttgtaaaac atatcaggaa agtaacagat ccttttgttg atcccggatt 4620 
agggaaaaat atacctttta tgattggagt tctttgtggg ggaattatat ttggaacagt 4 680 
agcagggttt gtctctatgg ttccttatat gatgaaagat gttcaccagc taagtactgc 4740 
cgaaatcgga agtgtaatta ttttccctgg aacaatgagt gtcattattt tcggctacat 4800 
tggtgggata cttgttgata gaagaggtcc tttatacgtg ttaaacatcg gagttacatt 48 60 
tctttctgtt agctttttaa ctgcttcctt tcttttagaa acaacatcat ggttcatgac 4 920 
aattataatc gtatttgttt taggtgggct ttcgttcacc aaaacagtta tatcaacaat 4980 
tgtttcaagt agcttgaaac agcaggaagc tggtgctgga atgagtttgc ttaactttac 5040 
cagcttttta tcagagggaa caggtattgc aattgtaggt ggtttattat ccataccctt 5100 
acttgatcaa aggttgttac ctatggaagt tgatcagtca acttatctgt atagtaattt 5160 
gttattactt ttttcaggaa tcattgtcat tagttggctg gttaccttga atgtatataa 5220 
acattctcaa agggatttct aaatcgttaa gggatcaact ttgggagaga gttcaaaatt 5280 
gatccttttt ttataacagt tcgaagcggc cgcaattctt gaagacgaaa gggcctcgtg 5340 
atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc 5400 
acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat 54 60 
atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag 5520 
agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt 5580 
cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt 5640 
gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga gagttttcgc 5700 
cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta 57 60 
tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac 5820 
ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa 5880 
ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg 5940 
atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc 6000 
cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg 6060 
atgcctgcag caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta 6120 
gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg accacttctg 6180 
cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg 6240 
tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc 6300 
tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt 6360 
gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt 6420 
gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc 6480 
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atgaccaaaa 

atcaaaggat 
aaaccaccgc 
aaggtaactg 
ttaggccacc 
ttaccagtgg 
tagttaccgg 
ttggagcgaa 
acgcttcccg 
gagcgcacga 
cgccacctct 
aaaaacgcca 
atgttctttc 
gctgataccg 
gaagagcgcc 
tggtgcactc 
tatcgctacg 
cctgacgggc 
gctgcatgtg 
gctcatcagc 
cgttgagttt 
cggttttttc 
ggtaatgata 
tgcccggtta 
gagaaaaatc 
gggtagccag 
ccgcgtttcc 
cgcagacgtt 
ctaaccagta 
gcgcacccgt 
ggcggacgcg 
gaattgattg 
attcaggtcg 
tatagggcgg 
atcgccgtga 
ccttgaagct 
ggcatcccga 
gtcg 



tcccttaacg 
cttcttgaga 
taccagcggt 
gcttcagcag 
acttcaagaa 
ctgctgccag 
ataaggcgca 
cgacctacac 
aagggagaaa 
gggagcttcc 
gacttgagcg 
gcaacgcggc 
ctgcgttatc 
ctcgccgcag 
tgatgcggta 
tcagtacaat 
tgactgggtc 
ttgtctgctc 
tcagaggttt 
gtggtcgtga 
ctccagaagc 
ctgtttggtc 
ccgatgaaac 
ctggaacgtt 
actcagggtc 
cagcatcctg 
agactttacg 
ttgcagcagc 
aggcaacccc 
ggccaggacc 
atggatatgt 
gctccaattc 
aggtggcccg 
cgcctacaat 
cgatcagcgg 
gtccctgatg 
tgccgccgga 



tgagttttcg 
tccttttttt 
ggtttgtttg 
agcgcagata 
ctctgtagca 
tggcgataag 
gcggtcgggc 
cgaactgaga 
ggcggacagg 
agggggaaac 
tcgatttttg 
ctttttacgg 
ccctgattct 
ccgaacgacc 
ttttctcctt 
ctgctctgat 
atggctgcgc 
ccggcatccg 
tcaccgtcat 
agcgattcac 
gttaatgtct 
acttgatgcc 
gagagaggat 
gtgagggtaa 
aatgccagcg 
cgatgcagat 
aaacacggaa 
agtcgcttca 
gccagcctag 
caacgctgcc 
tctgccaagg 
ttggagtggt 
gctccatgca 
ccatgccaac 
tccagtgatc 
gtcgtcatct 
agcgagaaga 



ttccactgag 
ctgcgcgtaa 
ccggatcaag 
ccaaatactg 
ccgcctacat 
tcgtgtctta 
tgaacggggg 
tacctacagc 
tatccggtaa 
gcctggtatc 
tgatgctcgt 
ttcctggcct 
gtggataacc 
gagcgcagcg 
acgcatctgt 
gccgcatagt 
cccgacaccc 
cttacagaca 
caccgaaacg 
agatgtctgc 
ggcttctgat 
tccgtgtaag 
gctcacgata 
acaactggcg 
cttcgttaat 
ccggaacata 
accgaagacc 
cgttcgctcg 
ccgggtcctc 
cgagatgcgc 
gttggtttgc 
gaatccgtta 
ccgcgacgca 
ccgttccatg 
gaagttaggc 
acctgcctgg 
atcataatgg 



cgtcagaccc 
tctgctgctt 
agctaccaac 
tccttctagt 
acctcgctct 
ccgggttgga 
gttcgtgcac 
gtgagctatg 
gcggcagggt 
tttatagtcc 
caggggggcg 
tttgctggcc 
gtattaccgc 
agtcagtgag 
gcggtatttc 
taagccagta 
gccaacaccc 
agctgtgacc 
cgcgaggcag 
ctgttcatcc 
aaagcgggcc 
ggggaatttc 
cgggttactg 
gtatggatgc 
acagatgtag 
atggtgcagg 
attcatgttg 
cgtatcggtg 
aacgacagga 
cgcgtgcggc 
gcattcacag 
gcgaggtgcc 
acgcggggag 
tgctcgccga 
tggtaagagc 
acagcatggc 
ggaaggccat 



cgtagaaaag 
gcaaacaaaa 
tctttttccg 
gtagccgtag 
gctaatcctg 
ctcaagacga 
acagcccagc 
agaaagcgcc 
cggaacagga 
tgtcgggttt 
gagcctatgg 
ttttgctcac 
ctttgagtga 
cgaggaagcg 
acaccgcata 
tacactccgc 
gctgacgcgc 
gtctccggga 
ctgcggtaaa 
gcgtccagct 
atgttaaggg 
tgttcatggg 
atgatgaaca 
ggcgggacca 
gtgttccaca 
gcgctgactt 
ttgctcaggt 
attcattctg 
gcacgatcat 
tgctggagat 
ttctccgcaa 
gccggcttcc 
gcagacaagg 
ggcggcataa 
cgcgagcgat 
ctgcaacgcg 
ccagcctcgc 



6540 

6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8704 
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(57) Abstract: The present invention features methods of producting 3-(2-hydroxy-3-methyl-butyiylamino)-propionic acid 
Q ("HMBPA") and a-hydroxyisovalerate ("a-HTV") utilizing microorganisms having modified pantothenate biosynthetic enzyme 

activities. Recombinant microorganisms and conditions for culturing same are also featured. Also featured are compositions 
^ including HMBPA and compositions including a-HIV. 
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METHODS AND MICROORGANISMS FOR THE PRODUCTION OF 3-(2- 
im)ROXY-3-METHyL-BUTYRYLAMINO)-PROPIONIC ACID (HMBPA) 

Related Applications 

5 The present invention claims the benefit of prior-filed provisional Patent 

Application Serial No. 60/263,053, filed January 19, 2001 (pending). The present 
invention is also related to U.S. Patent Application Serial No. 09/667,569, filed 
September 21, 2000 (pending), which is a continuation-in-part of U.S. Patent 
Application Serial No. 09/400,494, filed September 21, 1999 (abandoned). U.S. Patent 

10 Application Serial No. 09/667,569 also claims the benefit of prior-filed provisional 
Patent Application Serial No. 60/210,072, filed June 7, 2000, provisional Patent 
Application Serial No. 60/221,836, filed July 28, 2000, and provisional Patent 
Application Serial No. 60/227,860, filed August 24, 2000. The entire content of each of 
the above-referenced applications is incorporated herein by this reference. 

15 

Background of the Invention 

Conventional means of synthesizing chemical compounds is via synthesis 
from bulk chemicals, a process which is limited by factors such as substrate availability 
and/or cost, difficulty in resolving complex mixtures of products, complexities in 

20 synthesizing large quantities of compounds in purified form, and difficulty in producing 
chiral compounds. Accordingly, researchers have recently looked to bacterial or 
microbial systems that express enzymes usefiil for various biosynthetic processes, for 
example, in the synthesis of pharmaceutical compounds, research reagents, 
nutriceuticals, vitamins, nutritional supplements, antibiotic compounds and the like. In 

25 particular, bioconversion processes have been evaluated as a means of favoring 

production of preferred compounds and recently methods of direct microbial synthesis 
have been the focus of much research in the areas of pharmaceuticals and agriculture. 

Summary of the Invention 

30 The present invention relates to a processes for the dkect microbial 

syntiiesis of [R]-3-(2-hydroxy-3-methyl-butyrylamino)-propionic acid or 3-(2-hydroxy- 
3-methyl-butyrylamino)-propionic acid ("HMBPA"), referred to interchangeably herein 
as "(3-alanine 2-(/?)-hydroxyisolvalerate", "P-alanine 2-hydroxyisolvalerate", "P-alanyl- 
a-hydroxyisovalarate", N-(2-hydroxy-3-methyl-l-oxobutyl)-p-alanine ("HMOBA") 
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and/or "fantothenate". In particular, it has been discovered that in microorganisms 
engineered to overexpress certain en2ymes conventionally associated with pantothenate 
and/or isoleucine-valine (/7v) biosynthesis, an alternative biosynthetic pathway is present 
that competes for key precursors of pantothenate biosynthesis, namely a-ketoisovalerate 
5 (a-KIV) and jJ-alanine. a-KTV is converted to a-hydroxyisovalerate (a-HIV) catalyzed 
by various reductase enzymes and a-HIV is subsequently condensed with p -alanine to 
produce HMBPA. 

In one embodiment, the invention features a process for the production of 
3-(2-hydroxy-3-methyl'-butyrylamino)-propionic acid (HMBPA) that includes culturing 

10 a microorganism having increased keto reductase activity or increased pantothenate 
synthetase activity in the presence of excess a-ketoisovalerate and excess p-alanine, 
such that HMBPA is produced. In another embodiment, the invemtion features a 
process for the production of 3-(2-hydroxy-3-methyl-butyrylamino)-propionic acid 
(HMBPA) that includes culturing a microorganism having increased keto reductase 

1 5 activity and increased pantothenate synthetase activity in the presence of excess a- 
ketoiso valerate and excess P-alanine, such that HMBPA is produced. In one 
embodiment, the microorganism has a modified panE gene, for example, a modified 
panEl gene and/or a modified panE2 gene (^.g., the panE gene is overexpressed, 
deregulated or present in multiple copies). In another embodiment, the microorganism 

20 has a modified panC gene {e.g. , the panC gene is overexpressed, deregulated or present 
in multiple copies). In another embodiment, the microorganism further has increased 
acetohydroxyacid isomeroreductase activity. In another embodiment, the 
microorganism is cultured under conditions of increased acetohydroxyacid 
isomeroreductase activity in the presence of excess a-ketoisovalerate and excess p- 

25 alanine, such that HMBPA is produced. In yet another embodiment, the microorganism 
comprises a modified ilvC gene (e.g., the ilvC gene is overexpressed, deregulated or 
present in multiple copies). In yet anotiter embodiment, the microorganism fiirther has 
reduced ketopantoate hydroxymethyltransferase activity (e.g., has a modified panB gene, 
for example a panB gene that has been deleted. 

30 In another aspect, the uivention features a process for the production of 3- 

(2-hydroxy-3-methyl-butyrylamino)-propionic acid (HMBPA) that includes culturing a 
microorganism having reduced ketopantoate hydroxymethyltransferase activity in the 
presence of excess a-ketoisovalerate and excess p-alanine, such that HMBPA is 
produced. In another aspect, the invention features a method for enhancing production 
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of 3-(2-hydroxy-3-methyl-butyrylainino)-propionic acid (HMBPA) relative to 
pantothenate that includes culturing a recombinant microorganism under conditions such 
that the HMBPA production is enhanced relative to pantothenate production. In another 
aspect, the invention features a process for the production of 2-hydroxy isovaleric acid 
5 (a-HIV) that includes culturing a microorganism which overexpresses PanEl or PanE2 
and which further has reduced PanC or PanD activity under conditions such that a-HIV 
is produced. In another aspect, the invention features a process for the production of 3- 
(2-hydroxy-3-methyl-butyrylamino)-propionic acid (HMBPA) that includes culturing a 
recombinant microorganism having decreased expression or activity of serA or glyA 

10 under conditions such that HMBPA is produced. In another aspect, the invention 

features a process for the production of 3-(2-hydroxy-3-methyl-butyrylamino)-propionic 
acid (HMBPA) that includes culturing a recombinant microorganism having decreased 
expression or activity of serA and glyA under conditions such that HMBPA is produced. 
Conditions for culturing the above described microorganisms include, for example, 

15 conditions of increased steady state glucose, conditions of decreased steady state 
dissolved oxygen, and/or cultured under conditions of decreased serine. Products - 
produced according to the above described processes and/or methods are also featured. 
Also featured are recombinant microorganisms utilized in the above-described methods. 

Compounds produced according to the methodologies of the present 

20 invention have a variety of uses. For example, HMBPA can be used to synthesize 
. inhibitors of HMG CoA Reductase (II) (Gordon et al Bio, Med Chem, Lett 1(3):161 
(1991). Inhibitors of HMG CoA Reductase (II) have been studied for use as in the 
treatment of hypercholesterolaemia and coronary atherosclerosis progression. Inhibitors 
of HMG CoA Reductase also have been used to reduce risk of cardiovascular events in 

25 patients at risk. Moreover, the HMBPA precursor 2-hydroxyisovalerate (a-HIV) has • 
been demonstrated to have nutriceutical properties, for example, in the prevention of 
agmg of the skin. In particular, a-hydroxy acids, such as a-HIV (or 2-hydroxyvaline), 
can be used to synthesize a-hydroxy esters which have been found to induce increased 
skin thickness by increasing biosyntheses of glycosaminoglycans, proteoglycans, 

30 collagen, elastin, and other dermal components. The compounds can be used to treat 
skin disorders such as age spots, skin hues, wrinkles, photoaging and aging. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 
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Brief Description of the Drawings 

Figure 7 is a schematic representation of the pantothenate and isoleucine- 
valine (ilv) biosynthetic pathways. Pantothenate biosynthetic enzymes are depicted in 
bold and their corresponding genes indicated in italics. Isoleucine-valine (ilv) 
5 biosynthetic enzymes are depicted in bold italics and their corresponding genes indicated 
in italics. 

Figure 2 is a schematic representation of the biosynthetic pathway 
leading to [R]-3-(2-hydroxy-3-methyl-butyrylamino)-propionic acid ("HMBPA") in B, 
subtilis. 

10 Figure i is a schematic depiction of the structure of [R]-3-(2-hydroxy-3- 

methyl-butyrylamino)-propionic acid ("HMBPA"). 

Figure 4 is a HPLC chromatogram of a sample of medium from a 1 4 L 
fermentation of PA824. 

Figure 5 is a mass spectrum depicting the relative monoisotopic mass of 
1 5 [R]-3-(2-hydroxy-3 -methyl-butyrylamino)-propionic acid. 

, Figure 6 depicts an alignment of the C-terminal amino acids from known 

or suspected PanB proteins. 

Figure 7 is a schematic representation of the construction of the plasmid 

pAN624. 

20 Figure 5 is a schematic representation of the construction of the plasmid 

pAN620. 

Figure P is a schematic representation of the construction of the plasmid 

pAN636. 

Figure 70 is a schematic representation of the construction of the plasmid 
25 pAN637 which allows selection for single or multiple copies using chloramphenicol. 

Figure ii is a schematic representation of the construction of the plasmid 
pAN238, a plasmid for overexpressing B. subtilis panE2 from the P26 promoter. 

Detailed Description of the Invention 

30 The present invention is based, at least in part, on the discovery of a novel 

biosynthetic pathway in bacteria, namely the [R]-3-(2-hydroxy-3-methyl-butyrylamino)- 
propionic acid ("HMBPA") biosynthetic pathway. In particular, it has been discovered 
that bacteria are capable of generating HMBPA from a-ketoisovalerate (a-KIV), a key 
product of the isoleucine-valine (/7v) biosynthetic pathway and precursor of the 
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pantothenate biosynthetic pathway. Production of HMBPA in bacteria involves at least 
the pantothenate biosynthetic enzymes ketopantoate reductase (the panEl gene product) 
and/or acetohydroxyacid isomeroreductase (the ilvC gene product) and results from the 
condensation of 2-hydroxyisovaleric acid (a-HIV), formed by reduction of a-KIV, and 

5 p-alanine, the latter reaction being catalyzed by the pantothenate biosynthetic enzyme 
pantothenate synthetase (the panC gene product). Production of HMBPA is achieved by 
increasing ketopantoate reductase (e.g., PanEl) and/or PanE2 and/or acetohydroxyacid 
isomeroreductase activities (e.g., IlvC) in microorganisms, for example, by 
overexpressing or deregulating the genes encoding said enzymes. Optimal production of 

1 0 HMBPA is achieved by decreasing or deleting ketopantoate hydroxymethy [transferase 
activity (the panB gene product) in microorganisms, for example, by modifying or 
deleting the panB gene which encodes ketopantoate hydroxymethyltransferase (e.g., 
PanB), optionally in addition to increasing ketopantoate reductase and/or PanE2 and/or 
acetohydroxyacid isomeroreductase activities in said microorganisms. The substrates a- 

1 5 KIV and p-alanine are required for HMBPA production, the latter provided, for 

example, by p-alanine feeding and/or increased aspartate-a-decarboxylate activity (the 
panD gene product). Increasing substrate concentration (/.e., a-KIV and/or p-alanine) 
further enhances production of HMBPA. a-KIV production can be increased by 
overexpressing ilvBNCD genes and/or alsS. HMBPA production can further be • 

20 increased by limiting serine availability or synthesis in appropriately engineered 
microorganisms. 

In order that the present invention may be more readily understood, 
25 certain terms are first defined herein. 

The term "pantothenate biosynthetic pathway" includes the biosynthetic 
pathway involving pantothenate biosynthetic enzymes (e.g., polypeptides encoded by 
biosynthetic enzyme-encoding genes), compounds (e.g., precursors, substrates, 
intermediates or products), cofactors and the like utilized in the formation or synthesis of 
30 pantothenate. The term ^'pantothenate biosynthetic pathway" includes the biosynthetic 
pathway leading to the synthesis of pantothenate in microorganisms (e.g., in vivo) as 
well as the biosynthetic pathway leading to the synthesis of pantothenate in vitro. 
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The term "pantothenate biosynthetic enzyme" includes any enzyme 
utilized in the formation of a compound (e.g. , intermediate or product) of the 
pantothenate biosynthetic pathway. For example, synthesis of pantoate from a- 
ketoisovalerate (a-KIV) proceeds via the intermediate, ketopantoate. Formation of 
5 ketopantoate is catalyzed by the pantothenate biosynthetic enzyme ketopantoate 

hydroxymethyltransferase (the panB gene product). Formation of pantoate is catalyzed 
by the pantothenate biosynthetic enzyme ketopantoate reductase (the panE gene 
product). Synthesis of p-alanine from aspartate is catalyzed by the pantothenate 
biosynthetic enzyme aspartate-a-decarboxylase (the panD gene product). Formation of 

10 pantothenate from pantoate and p-alanine (e.g,, condensation) is catalyzed by the 
pantothenate biosynthetic enzyme pantothenate synthetase (the panC gene product). 
Based on the newly discovered HMBPA biosynthesis pathway, pantothenate 
biosynthetic enzymes may also perform an alternative function as enzymes in the 
HMBPA biosynthetic pathway described herein. 

15 The term "pantothenate" includes the free acid form of pantothenate, also 

referred to as "pantothenic acid" as well as any salt thereof (e.g., derived by replacing 
the acidic hydrogen of pantothenate or pantothenic acid with a cation, for example, 
calcium, sodium, potassium, ammonium), also referred to as a "pantothenate salt". The 
term "pantothenate" also includes alcohol derivatives of pantothenate. Preferred 

20 pantothenate salts are calcium pantothenate or sodium pantothenate. A preferred alcohol 
derivative is pantothenol. Pantothenate salts and/or alcohols of the present invention 
include salts and/or alcohols prepared via conventional methods from the free acids 
described herein. In another embodiment, calcium pantothenate is synthesized directly 
by a microorganism of the present invention. A pantothenate salt of the present 

25 invention can likewise be converted to a free acid form of pantothenate or pantothenic 
acid by conventional methodology. 

The term "isoleucine-valine biosynthetic pathway" includes the 
biosjoithetic pathway involving isoleucine-valine biosynthetic enzymes (e.g. , 
polypeptides encoded by biosynthetic enzyme-encoding genes), compounds (e.g., 

30 precursors, substrates, intermediates or products), cofactors and the like utilized in the 
formation or synthesis of conversion of pyruvate to valine or isoleucine. The term 
"isoleucine-valine biosynthetic pathway" includes the biosynthetic pathway leading to 
the synthesis of valine or isoleucine in microorganisms (e.g., in vivo) as well as the 
biosynthetic pathway leading to the synthesis of valine or isoleucine in vitro. Figure 1 
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includes a schematic representation of the isoleucine-valine biosynthetic pathway. 
Isoleucine-valine biosynthetic enzymes are depicted in bold italics and their 
corresponding genes indicated in italics 

The term "isoleucine-valine biosynthetic enzyme'' includes any enzyme 
5 utilized in the formation of a compound {e.g., intermediate or product) of the isoleucine- 
valine biosynthetic pathway. According to Figure 1, synthesis of valine from pyruvate 
proceeds via the intermediates, acetolactate, a,P-dihydroxyisovalerate (a,P-DHIV) and 
a-ketoisovalerate (a-KIV). Formation of acetolactate from pyruvate is catalyzed by the 
. isoleucine-valine biosynthetic enzyme acetohydroxyacid synthetase (the UvBNgQtiQ 

10 product, or altematively, the alsS gene product). Formation of a,p-DHIV from 
acetolactate is catalyzed by the isoleucine-valine biosynthetic enzyme 
acetohydroxyacidisomero reductase (the ilvC gene product). Synthesis of a-KIV from 
a,p-DHIV is catalyzed by the isoleucine-valine biosynthetic enzyme dihydroxyacid 
dehydratase (the ilvD gene product). Moreover, valine and isoleucine can be 

15 interconverted with their respective a-keto compounds by branched chain amino acid 
transaminases. Based on the newly discovered HMBPA biosynthesis pathway, 
isoleucine-valine biosynthetic enzymes may also perform an altemative fimction as 
enzymes in the HMBPA biosynthetic pathway described herein. 

The term "3-(2-hydroxy-3-methyl-butyrylamino)-propionic acid 

20 ("HMBPA") biosynthetic pathway" includes the altemative biosynthetic pathway 
involving biosynthetic enzymes and compounds (e,g, , substrates and the like) 
traditionally associated with the pantothenate biosynthetic pathway utilized in the 
formation or synthesis of HMBPA. The temi "HMBPA biosynthetic pathway" includes 
the biosynthetic pathway leading to the synthesis of HMBPA in microorganisms (e.g., in 

25 vivo) as well as the biosynthetic pathway leading to the synthesis of HMBPA in vitro. 

The term "HMBPA biosynthetic enzyme" includes any enzyme utilized 
in the formation of a compound (e.g., intermediate or product) of the HMBPA 
biosynthetic pathway. For example, synthesis of 2-hydroxyiso valeric acid (a-HIV) from 
a-ketoisovalerate (a-KTV) is catalyzed by the panEl or panE2 gene product (PanEl, 

30 alternatively referred to herein ketopantoate reductase or PanE2, a a-ketoacid reductase 
that does not significantly contribute to pantothenate biosynthesis) and/or is catalyzed by 
the ilvC gene product (altematively referred to herein as acetohydroxyacid 
isomeroreductase). Formation of HMBPA from p-alanine and a-HIV is catalyzed by the 
panC gene product (altematively referred to herein as pantothenate synthetase). 
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The term "3-(2-hydroxy-3-methyl-butyrylamino)-propionic acid 
("HMBPA")" includes the free acid form of HMBPA, also referred to as "3-(2-hydroxy- 
3-methyl-butyrylamino)-propionate" as well as any salt thereof (e.g., derived by 
replacmg the acidic hydrogen of [R]-3-(2-hydroxy-3-methyl-butyrylamino)-propionic 
5 acid or [R]-3-(2-hydxoxy-3-methyl-butyrylamino)-propionate with a cation, for example, 
calcium, sodium, potassium, ammonium), also referred to as a "3-(2-hydroxy-3-methyl- 
butyrylamino)-propionic acid salt" or "HMBPA salt". Preferred HMBPA salts are 
calcium HMBPA or sodium HMBPA. HMBPA salts of the present invention include 
salts prepared via conventional methods from the free acids described herein. An 
10 HMBPA salt of the present invention can likewise be converted to a free acid form of 
[R]-3-(2-hydroxy-3-methyl-butyrylamino)-propionic acid or [R]-3-(2-hydroxy-3-methyl- 
butyrylamino)-propionate by conventional methodology. 

Various aspects of the invention are described in fiirther detail in the 
15 following subsections. 

/. Targeting Genes Encoding Various Pantothenate and/or 
Isoleucine-Valinefilv) and/or HMBPA Biosynthetic Enzymes 

In one embodiment, the present invention features targeting or modifying 

20 various biosynthetic enzymes of the pantothenate and/or isoleucine-valine(77v) and/or 
HMBPA biosynthetic pathways. In particular, the invention features modifying various 
enzymatic activities associated with said pathways by modifying or altering the genes 
encoding said biosynthetic enzymes. 

The term "gene", as used herein, includes a nucleic acid molecule (e.g., a 

25 DNA molecule or segment thereof that, in an organism, can be separated from another 
gene or other genes, by intergenic DNA (z. e. , intervening or spacer DNA which naturally 
flanks the gene and/or separates genes in the chromosomal DNA of the organism). 
Alternatively, a gene may slightly overiap another gene {e.g., the 3' end of a first gene 
overiapping the 5' end of a second gene), said overiapping genes separated from other 

30 genes by intergenic DNA. A gene may direct synthesis of an enzyme or other protein 
molecule {e.g., may comprise coding seqeimces, for example, a contiguous open reading 
frame (ORF) which encodes a protein) or may itself be functional in the organism. A 
gene in an organism, may be clustered in an operon, as defined herein, said operon being 
separated from other genes and/or operons by the intergenic DNA. An "isolated gene", 



-8- 



wo 02/057476 



PCT/US02/01887 



as used herein, includes a gene which is essentially free of sequences which naturally 
flank the gene in the chromosomal DNA of the organism from which the gene is derived 
(/.e., is free of adjacent coding sequences which encode a second or distinct protein, 
adjacent structural sequences or the like) and optionally includes 5' and 3' regulatory 
5 sequences, for example promoter sequences and/or terminator sequences. In one 
embodiment, an isolated gene includes predominantly coding sequences for a protein 
(e.g., sequences which encode Bacillus proteins). In another embodiment, an isolated 
gene includes coding sequences for a protein (e.g., for a Bacillus protein) and adjacent 5' 
and/or 3' regulatory sequences from the chromosomal DNA of the organism from which 

10 the gene is derived (e.g., adjacent 5' and/or 3' Bacillus regulatory sequences). 

Preferably, an isolated gene contains less than about 10 kb, 5 kb, 2 kb, 1 kb, 0.5 kb, 0.2 
kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences that naturally flank the gene in 
the chromosomal DNA of the organism from which the gene is derived. 

The term "operon" includes at least two adjacent genes or ORFs, 

15 optionally overlapping in sequence at either the 5' or 3' end of at least one gene or ORF. 
The term "operon" includes a coordinated unit of gene expression that contains a 
promoter and possibly a regulatory element associated with one or more adjacent genes 
or ORFs (e.g., structural genes encoding enzymes, for example, biosynthetic enzymes). 
Expression of the genes (e.g., structural genes) can be coordinately regulated, for 

20 example, by regulatory proteins binding to the regulatory element or by anti- termination 
. of transcription. The genes of an operon (e.g. , structural genes) can be transcribed to 
give a single mRNA that encodes all of the proteins. 

A "gene having a mutation'* or "mutant gene" as used herein, includes a 
gene having a nucleotide sequence which includes at least one alteration (e.g., 

25 substitution, insertioni, deletion) such that the polypeptide or protein encoded by said 
mutant exhibits an activity that differs from the polypeptide or protein encoded by the 
wild-type nucleic acid molecule or gene. In one embodiment, a gene having a mutation 
or mutant gene encodes a polypeptide or protein having an increased activity as 
compared to the polypeptide or protein encoded by the wild-type gene, for example, 

30 when assayed under similar conditions (e.g. , assayed in microorganisms cultured at the 
same temperature). As used herein, an "increased activity" or "increased enzymatic 
activity" is one that is at least 5% greater than that of the polypeptide or protein encoded 
by the wild-type nucleic acid molecule or gene, preferably at least 5-10% greater, more 
preferably at least 10-25% greater and even more preferably at least 25-50%, 50-75% or 
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75-100% greater than that of the polypeptide or protein encoded by the wild-type nucleic 
acid molecule or gene. Ranges intermediate to the above-recited values, e.g., 75-85%, 
85-90%, 90-95%, are also intended to be encompassed by the present invention. As 
used herein, an "increased activity'' or "increased enzymatic activity" can also include 
5 an activity that is at least 1 .25-fold greater than the activity of the polypeptide or protein 
encoded by the wild-type gene, preferably at least 1 .5-fold greater, more preferably at 
least 2-fold greater and even more preferably at least 3-fold, 4-fold, 5-fold, 10-fold, 20- 
fold, 50-fold, 100-fold or greater than the activity of the polypeptide or protein encoded 
by the wild-type gene. 

10 In another embodiment, a gene having a mutation or mutant gene encodes 

a polypeptide or protein having a reduced activity as compared to the polypeptide or 
protein encoded by the wild-type gene, for example, when assayed under similar 
conditions {e.g., assayed in microorganisms cultured at the same temperature). A mutant 
gene also can encode no polypeptide or have a reduced level of production of the wild- ■ 

15 type polypeptide. As used herein, a "reduced activity" or "reduced enzymatic activity" 
is one that is at least 5% less than that of the polypeptide or protein encoded by the wild- 
type nucleic acid molecule or gene, preferably at least 5-10% less, more preferably at 
least 10-25% less and even more preferably at least 25-50%, 50-75% or 75-100% less 
than that of the polypeptide or protein encoded by the wild-type nucleic acid molecule or 

20 gene. Ranges intermediate to the above-recited values, e,g., 75-85%, 85-90%), 90-95%, 
are also intended to be encompassed by the present invention. As used herein, a 
"reduced activity" or "reduced enzymatic activity" can also include an activity that has 
been deleted or "knocked out" (e.g., approximately 100% less activity than that of the 
polypeptide or protein encoded by the wild-type nucleic acid molecule or gene). 

25 Activity can be determined according to any well accepted assay for 

measuring activity of a particular protein of interest. Activity can be measured or 
assayed directly, for example, measuring an activity of a protein isolated or purified 
from a cell or mocroorganism. Alternatively, an activity can be measured or assayed 
within a cell or mocroorganism or in an extracellular medium. For example, assaying 

30 for a mutant gene (Le. , said mutant encoding a reduced enzymatic activity) can be 

accomplished by expressing the mutated gene in a microorganism, for example, a mutant 
microorganism in which the enzyme is temperature-sensitive, and assaying the mutant 
gene for the ability to complement a temperature sensitive (Ts) mutant for enzymatic 
activity. A mutant gene that encodes an "increased enzymatic activity" can be one that 
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complements the Ts mutant more effectively than, for example, a corresponding wild- 
type gene. A mutant gene that encodes a "reduced enzymatic activity'' is one that 
complements the Ts mutant less effectively than, for example, a corresponding wild-type 
gene. 

5 It will be appreciated by the skilled artisan that even a single substitution 

in a nucleic acid or gene sequence (e.g., a base substitution that encodes an amino acid 
change in the corresponding amino acid sequence) can dramatically affect the activity of 
an encoded polypeptide or protein as compared to the corresponding wild-type 
polypeptide or protein. A mutant gene (e.g., encoding a mutant polypeptide or protein), 

10 as defined herein, is readily distinguishable from a nucleic acid or gene encoding a 
protein homologue in that a mutant gene encodes a protein or polypeptide having an 
altered activity, optionally observable as a different or distinct phenotype in a 
microorganism expressing said mutant gene or producing said mutant protein or 
polypeptide (i.e., a mutant microorganism) as compared to a corresponding 

15 microorganism expressing the wild-type gene. By contrast, a protein homologue has an 
identical or substantially similar activity, optionally phenotypically indiscemable when • 
produced in a microorganism, as compared to a corresponding microorganism 
expressing the wild-type gene. Accordingly it is not, for example, the degree of 
sequence identity between nucleic acid molecules, genes, protein or polypeptides that 

20 serves to distinguish between homologues and mutants, rather it is the activity of the 
encoded protein or polypeptide that distinguishes between homologues and mutants: 
homologues having, for example, low (e,g., 30-50% sequence identity) sequence identity 
yet having substantially equivalent functional activities, and mutants, for example 
sharing 99% sequence identity yet having dramatically different or altered functional 

25 activities. 

It will also be appreciated by the skilled artisan that nucleic acid 
molecules, genes, protein or polypeptides for use in the instant invention can be derived 
from any microorganisms having a HMBPA biosynthetic pathway, an ilv biosynthetic 
pathway or a pantothenate biosynthetic pathway. Such nucleic acid molecules, genes, 
30 protein or polypeptides can be identified by the skilled artisan using known techniques 
such as homology screening, sequence comparison and the like, and can be modified by 
the skilled artisan in such a way that expression or production of these nucleic acid 
molecules, genes, protein or polypeptides occurs in a recombinant microorganism (e.g., 
by using appropriate promoters, ribosomal binding sites, expression or integration 
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vectors, modifying the sequence of the genes such that the transcription is increased 
(taking into account the preferable codon usage), etc., according to techniques described 
herein and those known in the art). 

In one embodiment, the genes of the present invention are derived from a 
5 Gram positive microorganism organism {e.g, , a microorganism which retains basic dye, 
for example, crystal violet, due to the presence of a Gram-positive wall surrounding the 
microorganism). The term "derived from" (e.g. , "derived from'' a Gram positive 
microorganism) refers to a gene which is naturally found in the microorganism (e.g., is 
naturally found in a Gram positive microorganism). In a preferred embodiment, the 

10 genes of the present invention are derived from a microorganism belonging to a genus 
selected from the group consisting of Bacillus , Cornyebacterium (e.g., Cornyebacterium 
glutamicum), Lactobacillus, Lactococci and Streptomyces, In a more preferred 
embodiment, the genes of the present invention are derived from a microorganism is of 
the genus Bacillus. In another preferred embodiment, the genes of the present invention 

15 are derived from a microorganism selected from the group consisting of Bacillus 

subtilis, Bacillus lentimorbus. Bacillus lentus. Bacillus firmus. Bacillus pantothenticuSy 
Bacillus amyloliquefaciens, Bacillus cereus. Bacillus circulans. Bacillus coagulans. 
Bacillus licheniformiSy Bacillus megaterium. Bacillus pumilus. Bacillus thuringiensis. 
Bacillus halodurans, and other Group 1 Bacillus species, for example, as characterized 

20 by 16S rRNA type. In another preferred embodiment, the gene is derived from Bacillus 
brevis or Bacillus stearothermophilus. In another preferred embodinxent, the genes of 
the present invention are derived from a microorganism selected from the group 
consisting of Bacillus licheniformis. Bacillus amyloliquefaciens. Bacillus subtilis^ and 
Bacillus pumilus. In a particularly preferred embodiment, the gene is derived from 

25 Bacillus subtilis (e.g. , is Bacillus subtilis-deiivtd). The term "derived from Bacillus 
subtilis^' or ''Bacillus subtilis-dmyedi^ includes a gene which is naturally found in the 
microorganism Bacillus subtilis. Included within the scope of the present invention are 
Bacillus-denved genes (e.g., B. subtilis-denv^d genes), for example. Bacillus or B. 
subtilis cooAT genes, serA genes, glyA genes, coaA genes, pan genes and/or ilv genes. 

30 In another embodiment, the genes of the present invention are derived 

from a Gram negative (excludes basic dye) microorganism. In a preferred embodunent, 
the genes of the present invention are derived from a microorganism belonging to a 
genus selected from the group consisting of Salmonella (e.g.. Salmonella typhimurium), 
Escherichia, Klebsiella, Serratia, and Proteus. In a more preferred embodiment, the 
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genes of the present invention are derived from a microorganism of the genus 
Escherichia. In an even more preferred embodiment, the genes of the present invention 
are derived from Escherichia coli. In another embodiment, the genes of the present 
invention are derived from Saccharomyces (e.g., Saccharomyces cerevisiae). 

5 

II Recombinant Nucleic Acid Molecules and Vectors 
The present invention ftirther features recombinant nucleic acid 
molecules (e.g., recombinant DNA molecules) that include genes described herein (e.g., 
isolated genes), preferably Bacillus genes, more preferably Bacillus subtilis genes, even 

1 0 more preferably Bacillus subtilis pantothenate biosynthetic genes and/or isoleucine- 
valine (/7v) biosynthetic genes and/or HMBPA biosynthetic genes. The term 
"recombinant nucleic acid molecule'* includes a nucleic acid molecule (e.g., a DNA 
molecule) that has been altered, modified or engineered such that it differs in nucleotide 
sequence from the native or natural nucleic acid molecule from which the recombinant 

1 5 nucleic acid molecule was derived (e.g. , by addition, deletion or substitution of one or 
more nucleotides). Preferably, a recombinant nucleic acid molecule (e.g., a recombinant 
DNA molecule) includes an isolated gene of the present invention operably linked to 
regulatory sequences. The phrase "operably linked to regulatory sequence(s)" means 
that the nucleotide sequence of the gene of interest is linked to the regulatory 

20 sequence(s) in a manner which allows for expression (e.g. , enhanced, increased, 

constitutive, basal, attenuated, decreased or repressed expression) of the gene, preferably 
expression of a gene product encoded by the gene (e.g., when the recombinant nucleic 
acid molecule is included in a recombinant vector, as defined herein, and is introduced 
into a microorganism). 

25 The term "regulatory sequence" includes nucleic acid sequences which 

affect (e.g., modulate or regulate) expression of other nucleic acid sequences (/.e., 
genes). In one embodiment, a regulatory sequence is included in a recombinant nucleic 
acid molecule in a similar or identical position and/or orientation relative to a particular 
gene of interest as is observed for the regulatory sequence and gene of interest as it 

30 appears in nature, e.g. , in a native position and/or orientation. For example, a gene of 
interest can be included in a recombinant nucleic acid molecule operably linked to a 
regulatory sequence which accompanies or is adjacent to the gene of interest in the 
natural organism (e.g., operably linked to "native" regulatory sequences (e.g., to the 
"native" promoter). Alternatively, a gene of interest can be included in a recombinant 
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nucleic acid molecule operably linked to a regulatory sequence which accompanies or is 
adjacent to another (^.g., a different) gene in the natural organism. Alternatively, a gene 
of interest can be included in a recombinant nucleic acid molecule operably linked to a 
regulatory sequence from another organism. For example, regulatory sequences from 
5 other microbes {e.g., other bacterial regulatory sequences, bacteriophage regulatory 
sequences and the like) can be operably linked to a particular gene of interest. 

In one embodiment, a regulatory sequence is a non-native or non- 
naturally-occurring sequence {e.g., a sequence which has been modified, mutated, 
substituted, derivatized, deleted including sequences which are chemically synthesized). 

10 Preferred regulatory sequences include promoters, enhancers, termination signals, anti- 
termination signals and other expression control elements {e.g., sequences to which 
repressors or inducers bind and/or binding sites for transcriptional and/or translational 
regulatory proteins, for example, in the transcribed mRNA). Such regulatory sequences 
are described, for example, in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 

15 ' Cloning: A Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. Regulatory sequences include 
those which direct constitutive expression of a nucleotide sequence in a microorganism 
{e.g., constitutive promoters and strong constitutive promoters), those which direct 
inducible expression of a nucleotide sequence in a microorganism {e.g., inducible 

20 promoters, for example, xylose inducible promoters) and those which attenuate or 

. repress expression of a nucleotide sequence in a microorganism {e.g, attenuation signals 
or repressor sequences). It is also v^thin the scope of the present invention to regulate 
expression of a gene of interest by removing or deleting regulatory sequences. For 
example, sequences involved in the negative regulation of transcription can be removed 

25 such that expression of a gene of interest is enhanced. 

In one embodiment, a recombinant nucleic acid molecule of the present 
invention includes a nucleic acid sequence or gene that encode at least one bacterial gene 
product {e.g., a pantothenate biosynthetic enzyme, an isoleucine-valine biosynthetic 
enzyme and/or a HMBPA biosynthetic enzyme) operably linked to a promoter or 

30 promoter sequence. Preferred promoters of the present invention include Bacillus 
promoters and/or bacteriophage promoters {e.g., bacteriophage which infect Bacillus). 
In one embodiment, a promoter is a Bacillus promoter, preferably a strong Bacillus 
promoter {e.g., a promoter associated v^th a biochemical housekeeping gene in Bacillus 
or a promoter associated with a glycolytic pathway gene in Bacillus). In another 
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embodiment, a promoter is a bacteriophage promoter. In a preferred embodiment, the 
promoter is from the bacteriophage SPOl . In a particularly preferred embodiment, a 
promoter is selected from the group consisting of P26 Pveg> having for example, 
the following respective seqeunces: 
5 GCTATTGACGACAGCTATGGTTCACTGTCCACCAACCAAAACTGTGCTCAGT 
ACCGCCAATATTTCTCCCTTGAGGGGTACAAAGAGGTGTCCCTAGAAGAGAT 
CCACGCTGTGTAAAAATTTTACAAAAAGGTATTGACTTTCCCTACAG 
GTAATAATTTAATTACAGGCGGGGGCAACCCCGCCTGT(SEQ ID NOrl), 
GCCTACCTAGCTTCCAAGAAAGATATCCTAACAGCACAAGAGCGGAAAGAT 
1 0 GTTTTGTTCTACATCCAGAACAACCTCTGCTAAAATTCCTGAAAAATTTTGCA 
AAAAGTTGTTGACTTTATCTACAAGGTGTGGTATAATAATCTTAACAACAGC 
AGGACGC (SEQ ID N0:2), and 

GAGGAATCATAGAATTTTGTCAAAATAATTTTATTGACAA 
GTTGATATAATTTAAATTTTATTTGACAAAAATGG • 

1 5 AATGTAGTGAGGTGGATGCAATG (SEQ ID N0:3). Additional preferred promoters 
include tef (the translational elongation factor (TEF) promoter) and pyc (the pyruvate 
carboxylase (PYC) promoter), v/hich promote high level expression in Bacillus {e.g., 
Bacillus suhtilis). Additional preferred promoters, for example, for use in Gram positive 
microorganisms include, but are not limited to, amy and SP02 promoters. Additional 

20 preferred promoters, for example, for use in Gram negative microorganisms include, but 
are not hmited to, cos, tac, trp, tet, trp-tet, Ipp, lac, Ipp-lac, lacIQ, T7, T5, T3, gal, trc, 
ara, SP6, X-PRor X-PL. 

In another embodiment, a recombinant nucleic acid molecule of the 
present invention includes a terminator sequence or terminator sequences {e.g., 

25 transcription terminator sequences). The term "terminator sequences" includes 
regulatory sequences that serve to terminate transcription of mRNA. Temiinator 
sequences (or tandem transcription terminators) can ftirther serve to stabilize mRNA 
{^'gy by adding structure to mRNA), for example, against nucleases. 

In yet another embodiment, a recombinant nucleic acid molecule of the 

30 present invention includes sequences which allow for detection of the vector containing 
said sequences {i.e., detectable and/or selectable markers), for example, genes that 
encode antibiotic resistance or sequences that overcome auxotrophic mutations, for 
example, trpC, fluorescent markers, drug markers, and/or colorimetric markers (e.g., 
/acZ/p-galactosidase). In yet another embodiment, a recombinant nucleic acid molecule 
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of the present invention includes an artificial ribosome binding site (RBS) or a sequence 
that becomes transcribed into an artificial RBS. The term "artificial ribosome binding 
site (RBS)" includes a site within an mRNA molecule (e.g., coded within DNA) to 
which a ribosome binds (e.g. , to initiate translation) which differs from a native RBS 
5 (e.g., a RBS found in a naturally-occurring gene) by at least one nucleotide. Preferred 
artificial RBSs include about 5-6, 7-8, 9-10, 11-12, 13-14, 15-16, 17-18, 19-20, 21-22, 
23-24, 25-26, 27-28, 29-30 or more nucleotides of which about 1-2, 3-4, 5-6, 7-8, 9-10, 
1 1-12, 13-15 or more differ from the native RBS (e.g., the native RBS of a gene of 
interest, for example, the native panB RBS TAAACATGAGGAGGAGAAAACATG 

10 (SEQ ID N0:4) or the native panD RBS 

ATTCGAGAAATGGAGAGAATATAATATG (SEQ ID N0:5)). 

Preferably, nucleotides that differ are substituted such that they are 
identical to one or more nucleotides of an ideal RBS when optimally aligned for 
comparisons. Ideal RBSs include, but are not limited to, AGAAAGGAGGTGA (SEQ 

1 5 ID N0:6), TTAAGAAAGGAGGTGANNNNATG (SEQ ID N0:7), 
TTAGAAAGGAGGTGANNNNNATG (SEQ ID N0:8), 
AGAAAGGAGGTGANNNNNNNATG (SEQ ID N0:9), and 

AGAAAGGAGGTGANNNNNNATG (SEQ ID NO: 1 0). Artificial RBSs can be used to 
replace the naturally-occurring or native RBSs associated with a particular gene. 

20 Artificial RBSs preferably increase translation of a particular gene. Preferred artificial 
RBSs (e.g., RBSs for increasing the translation of panB, for example, of B. subtilis 
panB) include CCCTCTAGAAGGAGGAGAAAACATG (SEQ ID NO: 11) and 
CCCTCTAGAGGAGGAGAAAACATG (SEQ ID NO: 12). Preferred artificial RBSs 
(e.g., RBSs for increasing the translation of panD, for example, of B. subtilis panU) 

25 include TTAGAAAGGAGGATTTAAATATG (SEQ ID NO: 1 3), 
TTAGAAAGGAGGTTTAATTAATG (SEQ ID NO: 14), 
TTAGAAAGGAGGTGATTTAAATG (SEQ ID NO: 15), 

TTAGAAAGGAGGTGTTTAAAATG (SEQ ID NO: 16), ATTCGAGAAAGGAGG 
TGAATATAATATG (SEQ ID NO: 17), ATTCGAGAAAGGAGGTGAATAATAATG 
30 (SEQ ID NO: 1 8), and ATTCGTAGAAAGGAGGTGAATTAATATG (SEQ ID NO: 1 9). 

The present invention fiirther features vectors (e.g., recombinant vectors) 
that include nucleic acid molecules (e.g., genes or recombinant nucleic acid molecules 
comprising said genes) as described herein. The term "recombinant vector" includes a 
vector (e.g., plasmid, phage, phasmid, vuiis, cosmid or other purified nucleic acid 
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vector) that has been altered, modified or engineered such that it contains greater, fewer 
or different nucleic acid sequences than those included in the native or natural nucleic 
acid molecule from which the recombinant vector was derived. Preferably, the 
recombinant vector includes a biosynythetic enzyme-encoding gene or recombinant 
5 nucleic acid molecule including said gene, operably linked to regulatory sequences, for 
example, promoter sequences, terminator sequences and/or artificial ribosome binding 
sites (RBSs), as defined herein. In another embodiment, a recombinant vector of the 
present invention includes sequences that enhance replication in bacteria (e.g., 
replication-enhancing sequences). In one embodiment, replication-enhancing sequences 

10 are derived from E, coli. In another embodiment, replication-enhancing sequences are 
derived fi"om pBR322. 

In yet another embodiment, a recombinant vector of the present invention 
includes antibiotic resistance sequences. The term "antibiotic resistance sequences" 
includes sequences which promote or confer resistance to antibiotics on the host 

15 organism (e.g.. Bacillus). In one embodiment, the antibiotic resistance sequences are 
selected fi-om the group consisting of cat (chloramphenicol resistance) sequences, tet 
(tetracycline resistance) sequences, erm (erythromycin resistance) sequences, neo 
(neomycin resistance) sequences, kan (kanamycin resistance) and spec (spectinomycin 
resistance) sequences. Recombinant vectors of the present invention can further include 

20 homologous recombination sequences (e.g., sequences designed to allow recombination 
of the gene of interest into the chromosome of the host organism). For example, bpr, 
vpr^ and/or amyE sequences can be used as homology targets for recombination into the 
host chromosome. It will further be appreciated by one of skill in the art that the design 
of a vector can be tailored depending on such factors as the choice of microorganism to 

25 be genetically engineered, the level of expression of gene product desired and the like. 



TV. Recombinant Microorganisms 

The present invention further features microorganisms, i.e., recombinant 
30 microorganisms, that include vectors or genes (e.g., wild-type and/or mutated genes) as 
described herein. As used herein, the term "recombinant microorganism" includes a 
microorganism (e.g., bacteria, yeast cell, fungal cell, etc.) that has been genetically 
altered, modified or engineered (e.g., genetically engineered) such that it exhibits an 
altered, modified or different genotype and/or phenotype (e.g., when the genetic 
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modification affects coding nucleic acid sequences of the microorganism) as compared 
to the naturally-occurring microorganism from which it was derived. 

In one embodiment, a recombinant microorganism of the present 
invention is a Gram positive organism {e.g,, a microorganism which retains basic dye, 
5 for example, crystal violet, due to the presence of a Gram-positive wall surrounding the 
microorganism). In a preferred embodiment, the recombinant microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterhim, Lactobacillus , Lactococci and Streptomyces. In a more preferred 
embodiment, the recombinant microorganism is of the genus Bacillus, In another . 

10 preferred embodiment, the recombinant microorganism is selected from the group 
consisting of Bacillus subtilis, Bacillus lentimorbus. Bacillus lentus^ Bacillus firmus. 
Bacillus pantothenticus. Bacillus amyloliqiiefaciens. Bacillus cereus. Bacillus circularise 
Bacillus coagulans. Bacillus licheniformis. Bacillus inegaterium. Bacillus pumilus. 
Bacillus thuringiensis. Bacillus halodurans^ and other Group 1 Bacillus species, for 

15 example, as characterized by 16S rRNA type. In another preferred embodiment, the 
recombinant microorganism is Bacillus brevis or Bacillus stearothermophilus. In 
another preferred embodiment, the recombinant microorganism is selected from the 
group consisting of Bacillus licheniformis. Bacillus amyloliquefaciens, > Bacillus subtilis, 
and Bacillus pumilus. 

20 In another embodiment, the recombinant microorganism is a Gram 

negative (excludes basic dye) organism. In a preferred embodiment, the recombinant ; 
microorganism is a microorganism belonging to a genus selected from the group 
consisting of Salmonella, Escherichia, Klebsiella, Serratia, and Proteus. In a more 
preferred embodiment, the recombinant microorganism is of the genus Escherichia, In 

25 an even more preferred embodiment, the recombinant microorganism is Escherichia 
coli. In another embodiment, the recombinant microorganism is Saccharomyces (e.g., & 
cerevisiae), 

A preferred "recombinant" microorganism of the present invention is a 
microorganism having a deregulated pantothenate biosynthesis pathway or enzyme, a 
30 deregulated isoleucine-valine (zZv) biosynthetic pathway or enzyme and/or a deregulated 
HMBPA biosynthetic pathway or enzyme. The term "deregulated" or "deregulation" 
includes the alteration or modification of at least one gene in a microorganism that 
encodes an enzyme in a biosynthetic pathway, such that the level or activity of the 
biosynthetic enzyme m the microorganism is altered or modified. Preferably, at least 
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one gene that encodes an enzyme in a biosynthetic pathway is altered or modified such 
that the gene product is enhanced or increased. The phrase "deregulated pathway" can 
also include a biosynthetic pathway in which more than one gene that encodes an 
enzyme in a biosynthetic pathway is ahered or modified such that the level or activity of 
5 more than one biosynthetic enzyme is altered or modified. The ability to "deregulate" a 
pathway (e.g., to simultaneously deregulate more than one gene in a given biosynthetic 
pathway) in a microorganism in some cases arises fi*om the particular phenomenon of 
microorganisms in which more than one enzyme (e.g., two or three biosynthetic 
enzymes) are encoded by genes occurring adjacent to one another on a contiguous piece 

10 of genetic material termed an "operon" (defined herein). Due to the coordinated 
regulation of genes included in an operon, alteration or modification of the single 
promoter and/or regulatory element can result in alteration or modification of the 
expression of each gene product encoded by the operon. Alteration or modification of 
the regulatory element can include, but is not limited to removing the endogenous 

15 promoter and/or regulatory element(s), adding strong promoters, inducible promoters or 
multiple promoters or removing regulatory sequences such that expression of the gene 
products is modified, modifying the chromosomal location of the operon, altering 
nucleic acid sequences adjacent to the operon or within the operon such as a ribosome 
binding site, increasing die copy number of the operon, modifymg proteins (e.g., 

20 regulatory proteins, suppressors, enhancers, transcriptional activators and the like) 
involved in transcription of the operon and/or translation of the gene products of the 
operon, or any other conventional means of deregulating expression of genes routine in 
the art (including but not Hmited to use of antisense nucleic acid molecules, for example, 
to block expression of repressor proteins). Deregulation can also involve altering the 

25 coding region of one or more genes to yield, for example, an enzyme that is feedback 
resistant or has a higher or lower specific activity. 

In another preferred embodiment, a recombinant microorganism is 
designed or engineered such that at least one pantothenate biosynthetic enzyme, at least 
one isoleucine-valine biosynthetic enzyme, and/or at least one HMBPA biosynthetic 

30 enzyme is overexpressed. The term "overexpressed" or "overexpression" includes 
expression of a gene product (e.g., a biosynthetic enzyme) at a level greater than that 
expressed prior to manipulation of the microorganism or in a comparable microorganism 
which has not been manipulated. In one embodiment, the microorganism can be 
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genetically designed or engineered to overexpress a level of gene product greater than 
that expressed in a comparable microorganism which has not been engineered. 

Genetic engineering can include, but is not limited to, altering or 
modifying regulatory sequences or sites associated with expression of a particular gene 
5 (e.g. , by adding strong promoters, inducible promoters or multiple promoters or by 
removing regulatory sequences such that expression is constitutive), modifying the 
chromosomal location of a particular gene, altering nucleic acid sequences adjacent to a 
particular gene such as a ribosome binding site, increasing the copy number of a 
particular gene, modifying proteins (e.g., regulatory proteins, suppressors, enhancers, 

10 transcriptional activators and the like) involved in transcription of a particular gene 
and/or translation of a particular gene product, or any other conventional means of 
deregulating expression of a particular gene routine in the art (including but not limited 
to use of antisense nucleic acid molecules, for example, to block expression of repressor 
proteins). Genetic engineering can also include deletion of a gene, for example, to block 

15 a pathway or to remove a repressor. In embodiments featuring microorganisms having 
deleted genes, the skilled artisan will appreciate that at least low levels of certain 
compovmds may be required to be present in or added to the culture medium in order that 
the viability of the microorganism is not compromised. Often, such low levels are 
present in complex culture media as routinely formulated. Moreover, in processes 

20 featuring culturing microorganisms having deleted genes cultured under conditions such 
that commercially or industrially attractive quantities of product are produced, it may be 
necessary to supplement culture media with slightly increased levels of certain 
compovmds. For example, in processes featuring culturing a microorganism having a 
deleted panB gene, at least low levels of pantothenate must be present in the media, e.g.^ 

25 levels such as those found in routinely formulated complex media, whereas slightly 
increased levels of pantothenate may be added to the media in order to produce 
commercially or industrially attractive amounts of, for example, HMBPA. For example, 
10-30 mg/L pantothenate can be added to the media in order to produce commercially or 
industrially attractive amounts of HMBPA. 

30 In another embodiment, the microorganism can be physically or 

environmentally manipulated to overexpress a level of gene product greater than that 
expressed prior to manipulation of the microorganism or in a comparable microorganism 
which has not been manipulated. For example, a microorganism can be treated with or 
cultured in the presence of an agent known or suspected to increase transcription of a 



-20- 



wo 02/057476 



PCTAJS02/01887 



particular gene and/or translation of a particular gene product such that transcription 
and/or translation are enhanced or increased. Alternatively, a microorganism can be 
cultured at a temperature selected to increase transcription of a particular gene and/or 
translation of a particular gene product such that transcription and/or translation are 
5 enhanced or increased. 

K Culturing and Fermenting Recombinant Microorganisms 
The term "culturing" includes maintaining and/or growing a living 
i microorganism of the present invention {e.g, , maintaining and/or growing a culture or 

10 strain). In one embodiment, a microorganism of the invention is cultured in liquid 
media. In another embodiment, a microorganism of the invention is cultured in solid 
media or semi-solid media. In a preferred embodiment, a microorganism of the 
invention is cultured in media {e.g., a sterile, liquid media) comprising nutrients essential 
or beneficial to the maintenance and/or growth of the microorganism {e.g., carbon 

15 sources or carbon substrate, for example carbohydrate, hydrocarbons, oils, fats, fatty 
acids, organic acids, and alcohols; nitrogen sources, for example, peptone, yeast extracts, 
meat extracts, malt extracts, urea, ammonium sulfate, ammonium chloride, ammonium 
nitrate and auMnonium phosphate; phosphorus sources, for example, phosphoric acid, . 
sodium and potassium salts thereof; trace elements, for example, magnesium, iron, 

20 manganese, calcium, copper, zinc, boron, molybdenum, and/or cobalt salts; as well as > 
growth factors such as amino acids, vitamins, growtli promoters and the like). ; 

Preferably, microorganisms of the present invention are cultured under 
controlled pH. The term "controlled pH" includes any pH which results in production of 
the desired product {e.g., HMBPA). In one embodiment microorganisms are cultured at 

25 a pH of about 7. In another embodiment, microorganisms are cultured at a pH of 
between 6.0 and 8.5. The desired pH may be maintained by any number of methods 
known to those skilled in the art. 

Also preferably, microorganisms of the present invention are cultured 
under controlled aeration. The term "controlled aeration" includes sufficient aeration 

30 {e,g. , oxygen) to result in production of the desired product {e.g. , HMBPA). In one 
embodiment, aeration is controlled by regulating oxygen levels in the culture, for 
example, by regulating the amount of oxygen dissolved in culture media. Preferably, 
aeration of the culture is controlled by agitating the culture. Agitation may be provided 
by a propeller or similar mechanical agitation equipment, by revolving or shaking the 
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cuture vessel (e.g., tube or flask) or by various pumping equipment. Aeration may be 
further controlled by the passage of sterile air or oxygen through the medium (e.g. , 
through the fermentation mixture). Also preferably, microorganisms of the present 
invention are cultured without excess foaming (e.g., via addition of antifoaming agents). 
5 Moreover, microorganisms of the present invention can be cultured under 

controlled temperatures. The term "controlled temperature" includes any temperature 
which results in production of the desired product (e.g., HMBPA). In one embodiment, 
controlled temperatures include temperatures between 15*^C and 95°C. In another 
embodiment, controlled temperatures include temperatures between 15°C and 70°C. 
10 Preferred temperatures are between and 55°C, more preferably between 30°C and 
50°C. 

Microorganisms can be cultured (e.g., maintained and/or grown) in liquid 
media and preferably are cultured, either continuously or intermittently, by conventional 
culturing methods such as standing culture, test tube culture, shaking culture (e.g., rotary 

15 shaking culture, shake flask culture, etc.), aeration spinner culture, or fermentation. In a 
preferred embodiment, the microorganisms are cultured in shake flasks. In a more 
preferred embodiment, the microorganisms are cultured in a fermentor (e.g., a 
fermentation process). Fermentation processes of the present invention include, but are 
not limited to, batch, fed-batch and continuous processes or methods of fermentation. 

20 The phrase "batch process" or "batch fermentation" refers to a closed system in which 
the composition of media, nutrients, supplemental additives and the like is set at the 
beginning of the fermentation and not subject to alteration during the fermentation, 
however, attempts may be made to control such factors as pH and oxygen concentration 
to prevent excess media acidification and/or microorganism death. The phrase "fed- 

25 batch process" or "fed-batch" fermentation refers to a batch fermentation with the 
exception that one or more substrates or supplements are added (e.g., added in 
increments or continuously) as the feraientation progresses. The phrase "continuous 
process" or "continuous fermentation" refers to a system in which a defined fermentation 
media is added continuoxisly to a fermentor and an equal amount of used or 

30 "conditioned" media is simultaneously removed, preferably for recovery of the desired 
product (e.g. , HMBPA). A variety of such processes have been developed and are well- 
known in the art. 
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The phrase "culturing under conditions such that a desired compound is 
produced" includes maintaining and/or growing microorganisms under conditions (e.g., 
temperature, pressure, pH, duration, etc.) appropriate or sufficient to obtain production 
of the desired compound or to obtain desired yields of the particular compound being 

5 produced. For example, culturing is continued for a time sufficient to produce the 
desired amount of a compound' (e.g., HMBPA). Preferably, culturing is continued for a 
time sufficient to substantially reach suitable production of the compound (e.g., a time 
sufficient to reach a suitable concentration of HMBPA or suitable ratio of 
HMBPA:pantothenate). In one embodiment, culturing is continued for about 12 to 24 

10 hours. In another embodiment, culturing is continued for about 24 to 36 hours, 36 to 48 
hours, 48 to 72 hours, 72 to 96 hours, 96 to 120 hours, 120 to 144 hours, or greater than 
144 hours. In yet another embodiment, microorganisms are cultured under conditions 
such that at least about 5 to 10 g/L of compound are produced in about 36 hours, at least 
about 10 to 20 g/L compoimd are produced in about 48 hours, or at least about 20 to 30 

1 5 g/L compound in about 72 hours. In yet another embodiment, microorganisms are 

cultured under conditions such that at least a ratio of HMBPA:HMBPA+pantothenate of 
1 : 1 0 is achieved (L e, , 1 0% HMBPA versus 90% pantothenate, for example, as 
determined by comparing the peaks when a sample of product is analyzed be HPLC), • 
preferably such that at least a ratio of 2: 10 is achieved (20% HMBPA versus 90% ' 

20 pantotheante), more preferably such that a ratio of at least 2.5:10 is achieved (25% 
■ HMBPA versus 75% pantotheante), more preferably at least 3:10 (30% HMBPA versus 
70% pantotheante), 4:10 (40% HMBPA versus 60% pantotheante), 5:10 (50% HMBPA 
versus 50% pantotheante), 6:10 (60% HMBPA versus 40% pantotheante), 7:10 (70% 
HMBPA versus 30% pantotheante), 8:10 (80% HMBPA versus 20% pantotheante), 9:10 

25 (90% HMBPA versus 1 0% pantotheante) or greater. 

The methodology of the present invention can further include a step of 
recovering a desired compound (e.g., HMBPA). The term "recovering" a desired 
compound includes extracting, harvesting, isolating or purifying the compound from 
culture media. Recovering the compound can be performed according to any 

30 conventional isolation or purification methodology known in the art including, but not 
limited to, treatment with a conventional resin (e.g., anion or cation exchange resin, non- 
ionic adsorption resin, etc.), treatment with a conventional adsorbent (e.g., activated 
charcoal, silicic acid, siUca gel, cellulose, alumina, etc.), alteration of pH, solvent 
extraction (e.g., with a conventional solvent such as an alcohol, ethyl acetate, hexane and 
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the like), dialysis, filtration, concentration, crystallization, recrystallization, pH 
adjustment, lyophilization and the like. For example, a compound can be recovered 
from culture media by first removing the microorganisms from the culture. Media are 
then passed through or over a cation exchange resin to remove cations and then through 
5 or over an anion exchange resin to remove inorganic anions and organic acids having 
stronger acidities than the compound of interest. The resulting compound can 
subsequently be converted to a salt (e.g., a calcium salt) as described herein. 

Preferably, a desired compound of the present invention is "extracted", 
"isolated" or "purified" such that the resultmg preparation is substantially free of other 

10 media components (e.g., free of media components and/or fermentation byproducts). 
The language "substantially free of other media components" includes preparations of 
the desired compoimd in which the compound is separated from media components or 
fermentation byproducts of the culture from which it is produced. In one embodiment, 
the preparation has greater than about 80% (by dry weight) of the desired compound 

1 5 (e.g. , less than about 20% of other media components or fermentation byproducts), more 
preferably greater than about 90% of the desired compound (e.g., less than about 10% of 
other media components or fermentation byproducts), still more preferably greater than - 
about 95% of the desired compound (e.g., less than about 5% of other media components - 
or fermentation byproducts), and most preferably greater than about 98-99% desired • . 

20 compound (e.g., less than about 1-2% other media components or fermentation 

byproducts). When the desired compound has been derivatized to a salt, the compound 
is preferably further free of chemical contaminants associated with the formation of the 
salt. When the desired compound has been "derivatized to an alcohol, the compound is 
preferably further free of chemical contaminants associated with the formation of the 

25 alcohol. 

In an alternative embodiment, the desired compound is not purified from 
the microorganism, for example, when the microorganism is biologically non-hazardous 
(e.g., safe). For example, the entire culture (or culture supernatant) can be used as a 
source of product (e.g., crude product). In one embodiment, the culture (or culture 
30 supernatant) is used vsdthout modification. In another embodiment, the culture (or 

culture supernatant) is concentrated. In yet another embodiment, the culture (or culture 
supernatant) is dried or lyophilized. 
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Preferably, a production method of the present invention results in 
production of the desired compound at a significantly high yield. The phrase 
"significantly high yield" includes a level of production or yield which is sufficiently 
elevated or above what is usual for comparable production methods, for example, which 

5 is elevated to a level sufficient for commercial production of the desired product {e.g., 
production of the product at a commercially feasible cost). In one embodiment, the 
invention features a production method that includes culturing a recombinant 
microorganism under conditions such that the desired product (e.g., HMBPA) is 
produced at a level greater than 2 g/L. In another embodiment, the invention features a 

10 production method that includes culturing a recombinant microorganism imder 

conditions such that the desired product {e.g., HMBPA) is produced at a level greater 
than 10 g/L. In another embodiment, the invention features a production method that 
includes culturing a recombinant microorganism under conditions such that the desired 
product {e.g. , HMBPA) is produced at a level greater than 20 g/L. In yet another 

15 embodiment, the invention features a production method that includes culturing a 
recombinant microorganism under conditions such that the desired product {e.g., 
HMBPA) is produced at a level greater than 30 g/L. In yet another embodiment, the 
invention features a production method that includes culturing a recombinant 
microorganism under conditions such that the desired product {e.g., HMBPA) is 

20 produced at a level greater than 40 g/L. The invention further features a production 
method for producing the desired compound that involves culturing a recombinant 
microorganism imder conditions such that a sufficiently elevated level of compound is 
produced within a commercially desirable period of time. 

Depending on the biosynthetic enzyme or combination of biosynthetic 

25 enzymes manipulated, it may be desirable or necessary to provide {e.g. , feed) 

microorganisms of the present invention at least one biosynthetic precursor such that the 
desired compound or compoimds are produced. The term "biosynthetic precursor" or 
"precursor" includes an agent or compoimd which, when provided to, brought into 
contact with, or included in the culture medium of a microorganism, serves to enhance 

30 or increase biosynthesis of the desired product. In one embodiment, the biosynthetic 
precursor or precursor is aspartate. In another embodiment, the biosynthetic precursor or 
precursor is |3-alanine. The amount of aspartate or p-alanine added is preferably an 
amount that results in a concentration in the culture medium sufficient to enhance 
productivity of the microorganism {e.g., a concentration sufficient to enhance production 
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of HMBPA. The term "excess |3-alanine" includes p-alanine levels increased or higher 
that those routinely utilized for culturing the microorganism in question. For example, 
culturing the Bacillus microorganisms described in the instant Examples is routinely 
done in the presence of about 0-5 g/L P-alanine. Accordingly, excess P-alanine levels 
5 can include levels of about 5-10 g/L or more preferably about 5-20 g/L P-alanine. 
Biosynthetic precursors of the present invention can be added in the form of a 
concentrated solution or suspension {e.g., in a suitable solvent such as water or buffer) or 
in the form of a solid {e.g., in the form of a powder). Moreover, biosynthetic precursors 
of the present invention can be added as a single aliquot, continuously or intermittently 

1 0 over a given period of time. 

In yet another embodiment, the biosynthetic precursor is valine. In yet 
another embodiment, the biosynthetic precursor is a-ketoisovalerate. Preferably, valine 
or a-ketoisovalerate is added in an amoxint that results in a concentration in the medium 
sufficient for production of the desired product {e.g., HMBPA) to occur. The term 

15 "excess a-KIV" includes a-KIV levels increased or higher that those routinely utilized 
for culturing the microorganism in question. For example, culturing the Bacillus 
microorganisms described in the instant Examples can be done in the presence of about > 
0-5 g/L a-KIV. Accordingly, excess a-KIV levels can include levels of about 5-10 g/L, • 
and more preferably about 5-20 g/L. The term "excess valine" includes valine levels 

20 increased or higher that those routinely utilized for culturing the microorganism in . 

question. For example, culturing the Bacillus microorganisms described in the instant ' 
Examples is routinely done in the presence of about 0-0.5 g/L valine. Accordingly, 
excess valine levels can include levels of about 0.5-5 g/L, preferably about 5-20 g/L 
valine. Biosynthetic precursors are also referred to herein as "supplemental biosynthetic 

25 substrates". 

Moreover, certain aspects of the present invention include culturing 
microorganisms {e.g., recombinant microorganisms) under conditions of increased 
steady state glucose, decreased steady state dissolved oxygen and/or decreased serine. 
The term "increased steady state glucose" includes steady state glucose levels increased 
30 or higher that those routinely utilized for culturing the microorganism in question. For 
example, culturing the Bacillus microorganisms described in the instant Examples is 
routinely done in the presence of about 0.2-1.0 g/L steady state glucose. Accordingly, 
increased steady state glucose levels can include levels of about 1-2 g/1, about 2-5 g/1, 
and preferably about 5-20 g/L steady state glucose. The term "decreased steady state 
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dissolved oxygen" includes steady state dissolved oxygen levels less or lower that those 
routinely utilized for culturing the microorganism in question and, for example, 
inversely correlates with increased steady state glucose levels. For example, culturing 
the Bacillus microorganisms described in the instant Examples is routinely done in the 

5 presence of about 10-30% dissolved oxygen. Accordingly, decreased steady state 
dissolved oxygen can include levels of about 0-10%, and preferably about 0-5% steady 
state dissolved oxygen. The term "reduced serine" includes serine levels within the 
lower range of those routinely utilized for culturing the microorganism in question. For 
example, culturing the Bacillus microorganisms described in the instant Examples is 

10 routinely done in the presence of about 0-0.5 g/L serine. Accordingly, reduced serine 
levels can include, for example, levels of 0-0.1 g/L serine. 

Another aspect of the present invention includes biotransformation 
processes which feature the recombinant microorganisms described herein. The term 
"biotransformation process", also referred to herein as "bioconversion processes", 

1 5 includes biological processes which results in the production (e.g. , transformation or 
conversion) of appropriate substrates and/or intermediate compounds into a desired 
product (e.g., HMBPA). 

In one embodiment, the invention features a biotransformation process 
for the production of HMBPA comprising contacting a microorganism which 

20 overexpresses a reductase {e,g, , overexpresses PanEl , PanE2 and/or II vC) with 

appropriate substrates or precursors under conditions such that HMBPA is produced and 
recovering said HMBPA. In another embodiment, the invention features a 
biotransformation process for the production of HMBPA comprising contacting a 
microorganism which has a reduced or deleted PanB activity vsdth appropriate substrates 

25 or precursors under conditions such that HMBPA is produced and recovering said 

HMBPA. In yet another embodiment, the invention features a biotransformation process 
for the production of HMBPA comprising contacting a microorganism which 
overexpresses at least one reductase and has a reduced or deleted PanB activity with 
appropriate substrates or precursors under conditions such that HMBPA is produced and 

30 recovering said HMBPA. Preferred recombinant microorganisms for carrying out the 
above-described biotransformations include the recombinant microorganisms described 
herein. In yet another embodiment, the invention features a biotransformation reaction 
that includes contacting afflV and p-alanine with isolated or purified PanC under 
conditions such that HMBPA is produced. a-HIV can optionally be obtained by 
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contacting a-KIV with purified or isolated reductase (e.g., PanEl, PanE2 and/or IlvC) 
and a source of reducing equivalent, for example, NADH. Conditions under which a- 
HIV or HMBPA are produced can include any conditions which result in the desired 
production of a-HIV or HMBPA, respectively. In yet another embodiment, the present 

5 invention includes a method of producing a-HIV that includes culturing a 

microorganism that overexpresses PanEl and/or PanE2, and/or IlvC and has a reduced 
or deleted PanC or PanD (to reduce HMBPA or p-alanine sunthesis, respectively) under 
conditions such that a-HIV is produced. 

The microorganism(s) and/or enzymes used in the biotransformation 

1 0 reactions are in a form allowing them to perform their intended function (e.g. , producing 
a desired compound). The microorganisms can be whole cells, or can be only those 
portions of the cells necessary to obtain the desired end result. The microorganisms can 
be suspended (e.g. , in an appropriate solution such as buffered solutions or media), 
rinsed (e.g., rinsed fi-ee of media fi-om culturing the microorganism), acetone-dried, 

1 5 immobilized (e.g. , with polyacrylamide gel or k-carrageenan or on synthetic supports, 
for example, beads, matrices and the like), fixed, cross-linked or permeabUzed (e.g., 
have permeablized membranes and/or walls such that compounds, for example, 
substrates, intermediates or products can more easily pass through said membrane or 
wall). 

20 

This invention is further illustrated by the following examples which 
should not be construed as limiting. The contents of all references, patents and 
. published patent applications cited throughout this application are incorporated herein by 
25 reference. 
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EXAMPLES 

Example I: Discovery and Characterization of the [R]-3-(2-hydroxy-3-methyl- 
butyrylamino)-propionic acid (HMBPA) Biosynthetic Pathway 

5 In developing Bacillus strains for the production of pantothenate, various 

genetic manipulations were made to enzymes involved in the pantothenate biosynthetic 
pathway and the isoleucine-valine (z7v) pathway (Figure 1) as described in U.S. Patent 
Application Serial No. 09/400,494 and U.S. Patent Application Serial No. 09/667,569. 
For example, strains having a deregulated panBCD operon and/or having deregulated 

1 0 panEl exhibited enhanced pantothenate production (when cultured in the presence of P- 
alanine and a-KIV). Strains further deregulated for ilvBNC and ilvD exhibited enhanced 
pantothenate production in the presence of only P-alanine. Moreover, it was possible to 
achieve p-alanine independence by further deregulating panD, 

An exemplary strain is PA824, a tryptophan prototroph, Spec and Tet 

1 5 resistant, deregulated for panBCD at the panBCD locus, deregulated for panEl at the 
panEl locus (two genes in the B. subtilis genome are homologous to E, colipanE, 
panEl and panE2, the former encoding the major ketopantoate reductase involved in 
pantothenate production, while panE2 does not contribute to pantothenate synthesis 
(U.S. Patent Application Serial No. 09/400,494), deregulated for ilvD at the ilvD locus, 

20 overexpressing an ilvBNC cassette at the amyE locus, and overexpressing panD at the 
bpr locus. 

The production of pantothenic acid by PA824 was investigated in 14 L 
fermentor vessels. The composition of the batch and feed media are as follows. • 

25 BATCH 





MATERIAL 


g/L (final) 


1 


Yeast extract 


10 


2 


Na Glutamate 


5 


3 


(NH4)2S04 


8 


4 


KH2P04 


5 


5 


K2HP04 


7.6 



Addded After Sterilization and Cool Down 
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1 


Glucose 


2.5 


2 


CaClj 


0.1 


3 


MgClj 


1 


4 


Sodium Citrate 


1 


5 


FeS04-7H20 


0.01 


5 


SM-IOOOX 


1ml 



The final volume of the batch medium is 6 L. The trace element solution 
SM-IOOOX has following composition: 0.15 gNa2Mo04-2 H2O, 2.5 g H3BO3, 0.7 g 
CoCl2-6 H20, 0.25 g CuS04-5 H20, 1.6 g MnCl24 H20, 0.3 g ZnS04-7 H2O are dissolved 
5 in water (final volume 1 L). 

The batch medivim was moculated with 60 ml of shake flask PA824 
culture (OD=10 in SVY medium: Difco Veal Infusion broth 25 g, Difco Yeast extract 5 
g. Sodium Glutamate 5 g, (NH4)2S04 2.7 g in 740 ml H.O, autoclave; add 200 ml sterile 
1 M K2HPO4 (pH 7) and 60 ml sterile 50% Glucose solution (final volume IL)). The 
1 0 fermentation was run at 43 °C at an air flow rate of 1 2 L/min as a glucose limited fed 
batch. The initial batched glucose (2.5 g/L) was consumed during exponential growth. 
Afterwards glucose concentrations were maintained between 0.2-1 g/L by continuous 
feeding of FEED solution as follows. 



FEED 





MATERIAL 


g/L (final) 


1 


Glucose 


550 


2 


CaClj 


0.1 


3 


SM-IOOOX 


3ml 



The variable feed rate pump was computer controlled and linked to the 
glucose concentration in the tank by an algorithm. In this example the total feeding was 
6L. 

During fermentation the pH was set at 7.2. Control was achieved by pH 
20 measurements linked to computer control. The pH value was maintained by feeding 
either a 25% NHj-solution or a 20% H3P04-solution. NHj acts simultaneousely as a N- 
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source for the fermentation. The dissolved oxygen concentration [PO2] was set at 30% 

by regulation of the agitation and aeration rate. Foaming was controlled by addition of 

silicone oil. After the stop of the addition of the feed solution, in this example after 48 

h, the fermentation was continued until the [pOs] value reached 95%. Then the 

5 fermentation was stopped by killing the microorganism through sterilization for 30 min. 

The successful sterilization was proven by plating a sample of the fermentation broth on 

agar plates. The pantothenate titer in the fermentation broth was 21 .7 g/L after 

sterilization and removal of the cells by centriftigation (determined by HPLC analysis). 

For HPLC analysis the fermentation broth sample was diluted with sterile 

10 water (1 :40). 5 |li1 of this dilution was injected into a HPLC column (Aqua CI 8, 5|im, 

150*2.0 mm, Phenomenex™). Temperature of the column was held at 40°C. Mobile 

phase A was 14.8 mM H3PO3, mobile phase B 100% Acetonitrile. Flow rate was 

constant at 0.5 mL/min. A gradient was applied : 

start: 2% mobile phase B 

15 0-3 min linear increase to 3% mobile phase B 

3-3.5 min linear increase to 20% mobile phase B 

The detection was carried out by an UV-detector (210 nm). Run time 

was 7 min with an additional 3 min posttime. The retention time for pantothenic acid is 

3.9 minutes. The HPLC chromatogram for the above mentioned sample is given in 

20 Figure 4. 

Identification of compound related to peak with retention time 4. 7 minutes 

Under the described fermentation conditions, PA824 routinely yields 
approximately 20-30 g/L pantothenate. In addition to producing significant quantities of 

25 pantothenate, it was discovered a second compoiond eluted with an approximate 

retention time of 4.7 minutes in this system. The second prominent product formed in 
the fermentation was shown to be [R]-3-(2-hydroxy-3-methyl-butyrylamino)-propionic 
acid (HMBPA) (also referred to herein as "P-alanine 2-(i?)-hydroxyisolvalerate", "p- 
alanine 2-hydroxyisolvalerate", and/or "p-alanyl-a-hydroxyisovalarate). It was 

30 identified by its mass spectrum (Figure 5; relative monoisotopic mass 189), ^H- and 
13C-NMR (data not shown) after chromatographic purification by reverse phase flash 
chromatography (mobile phase 10 mM KH2PO4, with increasing contents of acetonitrile 
(1-50%)). 

35 
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In order to verify the identity of the compound, deliberate synthesis of 
racemic p-alanine 2-hydroxyisolvalerate was performed as follows. P-alanine (2.73 g / 
30 mmol) and sodium methoxide (5.67 g of a 30% solution in methanol / 31.5 mmol) 
were dissolved in methanol (40 mL). Methyl 2-hydroxyisovalerate (2-hydroxy-3- 
5 methylbutyric acid methyl ester) (3.96g / 30 mmol) was added and refluxed for 1 8 hours. 
Methanol was then removed by rotavap and replaced by tert-butanol (50 mL). 
Potassium tert-butoxide was added (50 mg) and refluxed for 26 hours. The solvent was 
removed in vacuo, the residue dissolved in water (50 mL) and passed through a strongly 
acidic ion-exchange resin (H+-form Lewatite™ S 100 Gl; 100 mL). More water is used 

10 to rinse the ion exchanger. The aqueous eluates are combined and the water removed in 
vacuo. The residue is subjected to flash chromatography (silica gel; 2% acetic acid in 
ethyl acetate as eluent) and the product fractions evaporated to give a soUd residue. The 
residue was recrystalUzed from ethyl acetate / toluene (10 mL / 20 mL, respectively) and 
analytically pure HMBPA (P-alanine 2-hydroxyisolvalerate) was obtained, which 

1 5 showed a relative monoisotopic mass of 1 90 ( 1 89 + HT) in the mass spec and the same 
^H-NMR resonances as the product obtained from fermentation. 

The biosynthetic pathway resulting in HMBPA production is set forth in 
Figure 2. The chemical structure of [R]-3-(2-hydroxy-3-methyl-butyrylamino)- 
propionic acid (HMBPA) is depicted in Figure 3. As depicted in Figure 2, HMBPA is 

20 the condensation product of a-hydroxyisovaleric acid (a-HIV) and P-alanine, catalyzed 
by the PanC enzyme. a-HIV is generated by reduction of a-KIV, a reaction which is 
catalyzed by the reductases PanE {e.g., PanEl and/or PanE2) and/or IlvC. 

Based on the chemical structure and biosynthetic pathway leading to 
HMBPA production, the present inventors formulated the following model to describe 

25 the interaction between the previously known pantothenate and isoleucine-valine (ilv) 
pathways and the newly characterized HMBPA biosynthetic pathway. In at least one 
aspect, the model states that there exist at least two pathways in microorganisms that 
compete for a-KIV, the substrate for the biosynthetic enzyme PanB, namely the 
pantothenate biosynthetic pathway and the HMBPA biosynthetic pathway. (A third and 

30 fourth pathway competing for a-KIV are those resulting in the production of valine or 
leucine from a-KIV, see e.g.. Figure 1). At least the pantothenate biosynthetic pathway 
and the HMBPA biosynthetic pathway further produce competitive substrates for the 
enzyme PanC, namely a-HIV and pantoate. The model predicts that reducing PanB 
activity will increase a-KIV availability for a-HIV synthesis (and ultimately, HMBPA 
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synthesis) and decrease the amount of pantoate and/or pantothenate synthesized by a 
microorganism. Conversely, increasing PanB activity will increase pantoate and 
ketopantoate availability for pantoate/pantothenate synthesis. The following examples 
provide expermental support for the model and further exemplify processes for 
5 increasing the production of HMBPA based on the model. 

EXAMPLES II-VI: 

For Examples II-VI, quanitation of pantothenate and/or HMBPA was 
10 performed as follows. Ahquots of fermentation media were diluted 1 :100 and aliquots 
of test tube cultures were diluted 1 : 1 0 in water or 5% acetonitrile prior to injection on a 
Phenomenex Aqua™ 5|a C18 HPLC column (250 x 4.60mm, 125 A). Mobile phases 
were A = 5% acetonitrile, 50 mM monosodium phosphate buffer adjusted to pH 2.5 with 
phosphoric acid; and B == 95% acetonitrile, 5% HjO. 
1 5 Linear gradients were as follows. 



Minutes 


Solvent A 


Solvent B 


0 


100% 


0% 


16 


100% 


0% 


17 


0% 


100% 


20 


0% 


100% 


21 


100% 


0% 



Additional parameters and apparatus were as follows: Flow rate = 1.0 
ml/min; Injection volume = 20 |j.l; Detector = Hewlett Packard 1090 series DAD UV 
20 detector-3014, Signal A = 197 nm, ref = 450 nm, Firmware revision E; Column heater = 
Oven tempature 40''C; Hardware = Hewlett Packard Kayak™ XA; and Software = 
Hewlett Packard Chemstation Plus™ family revision A.06.03[509]. 

Under these fermentation conditions, PA824 routinely yields 
approximately 30-40 g/L pantothenate. HMBPA elutes at approximately 13 minutes in 
25 this system. 
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Example II: Ketopantoate Reductase Contributes to the Production of HMBPA 
and Increasing Ketopantoate Reductase Activity in Bacillus Results 
in Enhanced HMBPA Production 

5 As described in Example I, a novel HPLC peak corresponding to 

HMBPA was observed in microorganisms overexpressing panEl indicating that 
increased ketopantoate reductase contributes to the production of HMBPA (in addition 
to production of pantothenate). As mentioned previously, two genes in the B. subtilis 
genome are homologous to the E, colipanE gene encoding ketopantoate reductase and 

10 have been named panEl and panE2. In Bacillus^ the panEl gene encodes the major 
ketopantoate reductase involved in pantothenate production, while panE2 does not 
contribute to pantothenate synthesis. In fact, overexpression of panE2 from a P26 
promoter leads to a reduction in pantothenate titer (see e.^., U.S. Patent Application 
Serial No. 09/400,494). 

15 Accordingly, it was tested whether, beside being produced by the panEl 

gene product, it was possible that a significant portion of the a-HIV necessary to make 
HMBPA was being produced by the panE2 gene product. It was hypothesized that the 
panE2 gene product is an enzyme that can reduce a-KIV to a-HIV, but that can not 
significantly reduce ketopantoate to pantoate. 

20 To test the hypothesis, /7a;2£2 was deleted from pantothenate production 

strain PA824 (described in Example I) by transforming with a ApanE2::cat cassette 
from chromosomal DNA of strain PA248 (ApanE2::cat) (set forth as SEQ ID NO:24, 
for construction see e.g., U.S. Patent Application Serial No. 09/400,494) to give strain 
PA919. Three isolates of PA919 were compared to PA824 for pantothenate and 

25 HMBPA production in test tube cultures grown in SV Y plus p-alanine. 
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As indicated by the data in Table 1, all three isolates of PA919 produced 
about four-fold lower HMBPA than PA824 demonstrating that the panE2 gene product 
is a potent contributor to HMBPA synthesis. Moreover, significant increases in 
5 HMBPA production can be achieved simply by overexpression of panE2. An exemplary 
plasmid for the overexpression of panE2, named pAN238, is set forth as SEQ ID NO:25 
(Figure 10). 



1 0 Example III. Increasing Production of HMBPA by Reducing PanB Activity in 
Microorganisms. 

Strains derived from PA365 (the RL-1 lineage equivalent of P A3 77, 
described in U.S. Patent AppUcation Serial No. 09/667,569) which are deleted for the 
P26 panBCD cassette and which contain a P26 pcinC^D cassette amplified at the vpr 

1 5 locus and either the wild type P26 P^^B cassette (PA666) or a P26 ^pcinB cassette 

(PA664) amplified at the bpr locus were constructed as follows. An alignment of the C- 
terminal amino acids of known or suspected PanB proteins is shown in Figure 6. Tluee 
regions called 1, 2 and 3, that were identified having conserved or semi-conserved amino - 
acid residues, are indicated by arrows at the top of the figure. The B. subtilis PanB 

20 protein (RBS02239) is underlined. Two of the PanB proteins (RCY14036 and 

CAB56202.1) are missing region 3 while the latter PanB protein is also missing region 2 
and has non-conserved amino acid residues occupying region 1 . 

B, subtilis PanB variants were created that were missing regions 1, 2 and 
3. The desired variants were created by designing 3' PGR primers to amphfy the B. 

25 subtilis pan B gene such that region 3, regions 2 and 3, or all three regions would be 
missmg fi^om the final product. The PGR products were generated and cloned into E. 
coli expression vector p ASK- IB A3, creating plasmids pAN446, pAN447, and pAN448, 
respectively. The plasmids were then transformed into E, coli strain S J2 that contains 
the panB6 mutation to test for complementation. Only pAN446, which is missing region 

30 3, was able to complement. This indicates that region 3 is not essential for B, subtilis 
PanB activity but that region 2 is required for activity or stability. 
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The next step in this analysis was to transfer the panB gene from pAN446 
to a B. subtilis expression vector and then introduce it into a strain appropriate for 
testing activity of the encoded PanB protein in B. subtilis. To do this, a strain that is 
deleted for the P26 ponBCD operon was first created. This was accomplished by first 
5 inserting a cat gene between the ^^-eRI site located just upstream of the panB RBS and . 
the Bg/ll site located in panD, creating plasmid pAN624 (Figure 7). The sequence of 
pAN624 is set forth as SEQ ID NO:20. The resulting deletion-substitution mutation 
{ApanBCD::cat624\ which removes all of panB and panC, was crossed into PA354 by 
transformation, with selection for resistance to chloramphenicol on plates supplemented 

10 with 1 mM pantothenate. One of the transformants was saved and named PA644. 
Chromosomal DNA isolated from PA644 was analyzed by PGR and was shown to 
contain the deletion-substitution mutation. As expected, PA644 requires pantothenate 
for growth but retains the engineered ilv genes (^^S^BNC V^S'^D) as well as the V^epan 
El gene originally present in PA354. Thus, it has all the enzymes involved in pantoate 

15 synthesis overproduced except PanB. The gene containing the shortest panB deletion 
was inserted into B. subtilis expression vector pOTP61 (described in US patent 
application Serial No. 09/667,569), creating plasmid pAN627. At the same time, a wild- ■ 
type panB control gene was inserted into pOTP61, creating plasmid pAN630. The Not] 
fragments of each plasmid, lacking E. coli vector sequences, were ligated and . 

20 transformed into PA644, with selection for resistance to tetracycline. 

One transformant from each transformation was saved and fiirther 
transformed with chromosomal DNA from PA628 with selection for Pan^. PA628 
contains a multicopy ?2^anC'^D expression plasmid (pAN620) integrated at the vpr 
locus. In order to determine the effects of the panB gene mutation directly on 

25 pantothenate production, plasmid pAN620, set forth as SEQ ID N0:21 and illustrated 
schematically in Figure 8, provides the remaining two enzymes required for 
pantothenate synthesis (PanC and PanD). Four transformants from each transformation 
were isolated, grown in SVY medium containing 10 g/L aspartate for 48 hours, then 
assayed for pantothenate production. Transformants with the 3 'deleted panB gene were 

30 named PA664 and those containing the wild-type gene were called PA666. The data 
showed that the V deleted panB gene in PA664 encodes a PanB protein with greatly 
reduced activity. To test for HMBPA production, test tube cultures of PA365, PA666, 
and PA664 were grown in SVY + aspartate medium with and without added a-KIV or 
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pantoate for 48 hours and then assayed for HMBPA and pantothenate as described 
previously. 
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The data presented in Table 2 demonstrate that in the absence of 
supplements, PA664 produced the most HMBPA while PA666 produced the least, 
indicating an inverse correlation between PanB activity and HMBPA production. This 
5 is consistent with the model which predicts that the two pathways compete for a-KIV, 
the substrate for PanB, and produce competitive substrates for PanC; lowering PanB 
activity would be expected to increase a-KIV availability for a-HIV synthesis and 
increase HMBPA production, correspondingly decreasing the amount of pantoate 
synthesized. When a-KIV is added to the medium, all three strains produced 

1 0 significantly more HMBPA. This result evidences that a-KIV is a precursor to 

HMBPA, as described in Figure 2, and that excess a-KIV favors HMBPA production. 
This resuh also suggests that synthesis of HMBPA is at least partially due to an overflow 
effect of excess a-KIV production. When pantoate was added to the medium, HMBPA 
was reduced by roughly 50 percent in all three strains. Conversely, the strains each 

15 produced significantly more pantothenate. This result is also consistent with the model 
that the two pathways produce competing substrates for PanC (a-HIV and pantoate). 
Taken together, the above results fiirther indicate that decreasing pantoate synthesis 
should be beneficial in promoting HMBPA production as well as reducing pantothenate 
levels. 

20 

Example IV. Methods for Regulating HMBPA:Paiitothenate Levels 

As demonstrated in Examples I and II, PanEl and/or PanE2 
contribute to enhanced HMBPA production as does reduced PanB activity. This- - 
Example demonstrates that overexpressing PanEl increases HMBPA production relative 
25 to pantothenate production whereas overexpressing PanB decreases HMBPA production 
relative to pantothenate production. Furthermore, in strains overexpressing IlvC, 
HMBPA production is enhanced. 

PA668 is a derivative of PA824 that contains extra copies of P26 P^^B 
amplified at the vpr or panB locus. PA668 was constructed using the panB expression 
30 vector (pAN636) which allows for selection of multiple copies using chloramphenicoL 
The sequence of pAN636 is set forth as SEQ ID NO:22 and the vector is depicted 
schematically in Figure 9. The pAN636 JVbr/ restriction firagment, missing the E, coli 
vector sequences, was ligated and then used to transform PA824 with selection on plates 
contailing 5 p.g/ml chloramphenicol. Transformants resistant to 30 )xg/ml 
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chloramphenicol were isolated and screened for pantothenate production in 48 hour test 
tube cultures. The isolates shown produce less HMBPA that PA824 (conversely 
producing about 10 percent more pantothenate than PA824). A second strain, called 
PA6695 was constructed which is PA824 with extra copies of P26 pcinEl ampHfied at 
5 the vpr or panEl locus. Strain PA669 was constructed by transforming PA824 with the 
self-Ugated Notl fragment of plasmid pAN637 with selection for resistance to 
chloramphenicol. The sequence of pAN637 is set forth as SEQ ID NO:23 and the vector 
is depicted schematically in Figure 10. Two isolates of PA669 were chosen for further 
study; PA669-5 produces less PanEl than PA669-7 as judged by SDS-PAGE analysis of 
1 0 total cell extracts made from the two strains. 

Test tube cultures of strains PA824, PA668-2, PA668-24, and the two 
isolates of PA669 (PA669-5 and PA669-7) were grown in three different media (SVY, 
SVY + aspartate, and SVY + aspartate + pantoate) for 48 hours and then assayed for 
pantothenate, HMBPA, and p-alanine (Table 3), 
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None of the strains produced detectable quantities of HMBPA in SVY 
medium. All strains produced roughly equivalent amounts of pantothenate and low 
amounts of p-alanine indicating that p-alanine is limiting for both pantothenate and 
5 HMBPA synthesis in these cultures and that p-alanine is a precursor for both 
compounds. When grown in SVY + aspartate medium, the two PA669 isolates 
produced more HMBPA than PA824 whereas both PA668 isolates produced less 
HMBPA than PA824. It is noteworthy that the strain that produces the most PanEl 
(PA669-7) produced the most HMBPA (and the least pantothenate). This suggests that 

1 0 high levels of PanE 1 favor the production of HMBPA at the expense of lower 

pantothenate synthesis. It is also interesting that PA668-24 produced more HMBPA 
than PA668-2, even though SDS-PAGE analysis of extracts from the two strains showed 
that they produce roughly equivalent levels of PanB. The SDS-PAGE analysis also 
showed that PA668-24 makes much more IlvC than PA668-2. Based on these data, it is 

15 proposed that IlvC influences HMBPA synthesis by increasing steady state levels of 
a-KIV and/or by catalyzing a-HIV formation from a-KIV, thereby accounting for the 
observed shift towards production of HMBPA. 

The final set of data in Table 3 shows that adding pantoate to the growth 
medium decreased HMBPA production by all strains that had previously produced 

20 detectable levels, e.g.y by shifting synthesis towards pantothenate. This fiirther supports 
the model that a-HIV and pantoate are competitive substrates for PanC. 

Example V: Increasing HMBPA Production by Limiting Serine Availability 

25 

It was hypothesized that the ratio of pantothenate to HMBPA production 
could also be controlled by regulating the availability of serine or methylene 
tetrahydrofolate in the microorganism cultures. In particular, it is proposed that 
decreasing the availability of serine could increase HMBPA production relative to 

30 pantothenate production, whereas increasing the availability of serine would decrease the 
production of IDvIBPA relative to pantothenate production. This method is based on the 
understanding that the PanB substrate, methylenetetrahydrofolate is derived from serine. 
Thus, regulating serine levels should effectively regulate PanB substrate levels. To test 
this hypothesis, PA824 was grown in test tube cultures of SVY glucose plus 5 g/L P- 

35 alanine and ± 5 g/L serine for 48 hours at 43°C. 
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Table 4: Production o/HMBPA and pantothenate by PA824 with and without 
the addition of serine 



serine added 
at5g/L 




[pan]g/L 


[HMBPA]g/L 




16.3 


4.9 


0.84 




14.0 


4.5 


0.80 


+ 


13.1 


6.4 


0.56 


+ 


12.9 


6.0 


0.62 



5 

As demonstrated in Table 4, addition of serine decreases the level of 
production while conversely increasing pantothenate production. At least one method of 
decreasing methylene tetrahydrofolate levels in order to regulate HMBPA production 
levels is to decrease the activity of serine hydroxymethyl transferase (the glyA gene 
1 0 product), thereby decreasing metiiylene tetrahydrofolate biosynthesis in appropriately, 
engineered microorganisms. At least one method of decreasing serine levels in order to 
regulate HMBPA production is to decrease the activity of 3-phosphoglycerate 
dehydrogenase (the serA gene product). 

15 

Example VI: Increasing HMBPA Production by Modifying Culture Conditions for 
Recombinant Microorganisms 

In at least one fermentation (Fermentation P 1 62), levels of HMBPA 
production reached 35 g/L. Briefly, fermentation of strain PA824 was carried out as 
20 described in Example I but utili2ang PFM-155 medium formulated as follows. 



BATCH 





MATERIAL 


g/L (fmai) 


1 


Amberex 1003 


5 


2 


Cargill 200/20 (soy flour) 


40 


3 


Na Glutamate 


5 


4 


(NHa),SO. 


8 


5 


MgSOi-THjO 


1 
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6 


MAZU DF204C 


1 


7 


HjO 


qs to 4 L 


Added After Sterilization and Cool Down 








1 


KH2PO4 


10 


2 


K2HP04-3H2O 


20 


. 3 


H2O 


qs to 400 ml 








1 


80% Glucose 


20 


2 


CaCl2-2H20 


0.1 








1 


Sodium Citrate 


1 


2 


FeS04-7H20 


0.01 


3 


SM-IOOOX 


IX 








FEED 




MATERIAL 


g/L (final) 


1 


80% Glucose 


800 


2 


CaCl2-2H20 


0.8 


3 


H2O 


qs to 3500 ml 


Added After Sterilization and Cool Down 








1 


Sodium Citrate 


2.0 


2 


FeS04-7H20 


0.02 


3 


SM-IOOOX 


2X 








4 


Glutamate Na 


5.0 


5 


H2O 


qs to 500 ml 



5 

However, as a result of loss of process control during the fermentation, 
the dissolved oxygen became limitiag between 16 and 17 hours and glucose began to 
accumulate after 16 hours. 
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These changes m fermentation conditions produced the following 
significant results at or after 16 hours. Namely, synthesis of HMBPA began to increase 
with a corresponding decrease in pantothenate synthesis. In the four hour interval before 
16 hours the culture produced 7 g/I HMBPA, four hours afterwards, 9.0 g/1. 
5 Pantothenate was the reverse with 10 g/l and 6.0 g/1 produced between 12-16 hours and 
16-20 hours, respectively. Between 20 and 36 hours the average rate of HMBPA 
synthesis was 1.0 g/1 hr. Overall, fermentation P162 produced 35 g/1 of HMBPA in 36 
hours. 

Thus, it appears that overfeeding of glucose, and/or limitation of 
10 dissolved oxygen (e.g. , beginning at about 1 6 hours) leads to an increase in HMBPA 
production. Accordingly, two methods for increasing HMBPA production (relative to 
pantothenate production) are to increase steady state glucose levels and/or decrease 
steady state dissolved oxygen levels. 



Equivalents Those skilled in the art will recognize, or be able to ascertain using no 
more than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be encompassed by. the 
20 following claims. 
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What is claimed: 

1 . A process for the production of 3-(2-hydroxy-3-methyl- 
butyrylamino)-propionic acid (HMBPA), comprising culturing a microorganism under 

5 conditions such that HMBPA is produced and detecting the HMBPA produced by said 
microorganism. 

2. A process for the production of 3 -(2-hydroxy-3 -methyl- 
butyrylamino)-propionic acid (HMBPA), comprising culturing a microorganism under 

1 0 conditions such that HMBPA is produced and isolating the HMBPA produced by said 
microorganism. 

3. A process for the production of 3-(2-hydroxy-3-methyl- 
butyrylamino)-propionic acid (HMBPA), comprising culturing a microorganism having 

15 increased keto reductase activity or increased pantothenate synthetase activity in the 
presence of excess a-ketoisovalerate and excess P-alanine, such that HMBPA is 
produced. 

4. A process for the production of 3-(2-hydroxy-3-methyl- 

20 butyrylamino)-propionic acid (HMBPA), comprising culturing a microorganism having 
increased keto reductase activity and increased pantothenate synthetase activity in the 
presence of excess a-ketoisovalerate and excess P-alanine, such that HMBPA is 
produced. 

5. The process of claim 3 or 4, wherein said microorganism 
25 comprises a modified panE gene. 

6. The process of claim 5, wherein the panE gene is overexpressed, 
deregulated or present in multiple copies. 

30 7. The process of claim 3 or 4, wherein said microorganism 

comprises a modified panEl gene. 

8. The process of claim 3 or 4, wherein said microorganism 
comprises a modified panE2 gene. 
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9. The process of claiin 3 or 4, wherein said microorganism 
comprises a modified panEl gene and a modified panE2 gene. 

5 10. The process of claim 3 or 4, wherein said microorganism 

comprises a modified panC gene. 

11. The process of claim 3 or 4, wherein the panC gene is 
overexpressed, deregulated or present in muhiple copies. 

10 

12. The process of claim 3 or 4, wherein said microorganism further 
has increased acetohydroxyacid isomeroreductase activity. 

13. A process for the production of 3-(2-hydroxy-3-methyl- 

1 5 butyrylamino)-propionic acid (HMBPA), comprising culturing a microorganism having 
increased acetohydroxyacid isomeroreductase activity in the presence of excess a- 
ketoisovalerate and excess p-alanine, such that HMBPA is produced. 

14. The process of claim 12 or 13, wherein said microorganism 
20 comprises a modified ilvC gene. 

15. The process of claim 14, wherein the ilvC gene is overexpressed, 
deregulated or present in multiple copies. 

25 1 6. The process of any one of claims 3, 4 or 1 1 , wherein said 

microorganism fiirther has reduced ketopantoate hydroxymethyltransferase activity. 

17. The process of claim 16, wherein said microorganism comprises a 
modified panB gene. 

30 

1 8. The process of claim 1 6, wherein said microorganism has been 
deleted for the panB gene. 
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10 



30 



19. A process for the production of 3-(2-hydroxy-3-methyl- 
butyryIamino)-propionic acid (HMBPA), comprising culturing a microorganism having 
reduced ketopantoate hydroxymethyltransferase activity in the presence of excess a- 
ketoisovalerate and excess p-alanine, such that HMBPA is produced. 

20. A method for enhancing production of 3-(2-hydroxy-'3-methyl- 
butyrylaniino)-propionic acid (HMBPA) relative to pantothenate, comprising culturing a 
recombinant microorganism under conditions such that the HMBPA production is 
enhanced relative to pantothenate production. 



21 . A process for the production of 2-hydroxyisovaleric acid (a-HIV), 
comprising culturing a microorganism which overexpresses PanEl or PanE2 and which 
further has reduced PanC or PanD activity under conditions such that a-HIV is 
produced. 

15 22. A process for the production of 3-(2-hydroxy-3-methyl- 

butyrylamino)-propionic acid (HMBPA), comprising culturing a recombinant 
microorganism having decreased expression or activity of ser A or glyA under conditions 
such that HMBPA is produced. 

20 23 . A process for the production of 3-(2-hydroxy-3-methyl- 

butyrylamino)-propionic acid (HMBPA), comprising culturing a recom.binant 
microorganism having decreased expression or activity of serA and glyA under 
conditions such that HMBPA is produced. 

25 24. The process of any one of the proceeding claims whereia the 

microorganism is cultured vmder conditions of increased steady state glucose. 

25. The process of any one of the proceeding claims wherein the 
microorganism is cultured under conditions of decreased steady state dissolved oxygen. 



26. The process of any one of the proceeding claims wherein the 
microorganism is cultured under conditions of decreased serine. 

27. A product produced according to any one of the above claims. 
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28. A recombinant microorganism that produces 3-(2-hydroxy-3- 
methyl-butyrylamino)-propiomc acid (HMBPA), the microorganism having a 
modification in at least one gene encoding ketopantoate reductase that resuhs in 
increased reductase activity and having a mutation or deletion m the panB gene that 
results in reduced ketopantoate hydroxymethyltransferase activity. 

29. The recombinant microorganism of claim 28, wherein the gene 
encoding ketopantoate reductase is a panE gene. 

30. The recombinant microorganism of claim 29, v^herein the panE 

gene is panEL 

3 1 . The recombinant microorganism of claim 29, wherein the panE 

gene is panE2. 

32. The recombinant microorganism of claim 28, wherein the 
microorganism has a modification in panEl and panE2. 

33. The recombinant microorganism of claim 28, further having a 
modification in ilvC that results in increased acetohydroxyacid isomeroreductase 
activity. 

34. A recombinant microorganism that produces 3-(2-hydroxy-3- 
methyl-butyrylamino)-propionic acid (HMBPA), the microorganism havmg a 
modification in ilvC that results in increased acetohydroxyacid isomeroreductase activity 
and having a mutation or deletion in the panB gene that results in reduced ketopantoate 
hydroxymethyltransferase activity. 

35. The recombinant microorganism of any one of claims 28 to 34, 
wherein said microorganism belongs to the genus Bacillus. 

36. The recombinant microorganism of claim 35, which is Bacillus 

subtilis. 
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FIG. 2 
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FIG. 3 
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FIG. 6 
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SEQUENCE LISTING 
<110> OmniGene BioProducts Inc., e/a/. 

<120> METHODS AND MICROORGANISMS FOR THE PRODUCTION OF 3-(2- 
HYDROXY-S-METHYL-BiTYRLAMINO) -PROPIONIC ACID (HMBPA) 

<130> BGI~146PC 

<140> 
<141> 

<160> 25 

<170> PatentIn Ver. 2.0 

<210> 1 
<211> 194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (136) (141) 

<220> 

<221> -10_signal 
<222> (159) . . (164) 

<400> 1 

gctattgacg acagctatgg ttcactgtcc accaaccaaa actgtgctca gtaccgccaa 60 

tatttctccc ttgaggggta caaagaggtg tccctagaag agatccacgc tgtgtaaaaa 120 

ttttacaaaa aggtattgac tttccctaca gggtgtgtaa taatttaatt acaggcgggg 180 
gcaaccccgc ctgt 194 



<210> 2 
<211> 163 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: promoter 
sequence 

<220> 

<221> -35_signal 
<222> (113).. (118) 

<220> 

<221> -10_signal 
<222> (136) (141) 
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<400> 2 

gcctacctag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 60 
tacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 



<210> 3 
<211> 127 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : promoter 



<220> 

<221> -35_signal 
<222> (34) . . (39) 

<220> 

<221> ~10_signal 
<222> (58).. (63) 

<220> 

<221> ~35_signal 
<222> (75) . . (80) 

<220> 

<221> -10_signal 
<222> (98) . . (103) 

<400> 3 

gaggaatcat agaattttgt caaaataatt ttattgacaa cgtcttatta acgttgatat 60 

aatttaaatt ttatttgaca aaaatgggct cgtgttgtac aataaatgta gtgaggtgga 120 
tgcaatg 127 



<210> 4 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence rribosome 
binding site 



atctacaagg 



tgtggtataa taatcttaac aacagcagga cgc 



163 



sequence 



<400> 4 

taaacatgag gaggagaaaa catg 



24 



<210> 5 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence rribosome 
binding site 

<400> 5 

attcgagaaa tggagagaat ataatatg 

<210> 6 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence rribosome 
binding site 

<400> 6 
agaaaggagg tga 

<210> 7 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence rribosome 
binding site 

<220> 

<223> All occurrences of n = any nucleotide 
<400> 7 

ttaagaaagg aggtgannnn atg 

<210> 8 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence rribosome 
binding site 

<220> 

<223> All occurrences of n - any nucleotide 
<400> 8 

ttagaaagga ggtgannnnn atg 



<210> 9 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequencer ribosome 
binding site 
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<220> 



<223> All occurrences of n = any nucleotide 



<400> 9 

agaaaggagg tgannnnnnn atg 



23 



<210> 10 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<220> 

<223> All occurrences of n = any nucleotide 
<400> 10 

agaaaggagg tgannnnnna tg 22 

<210> 11 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 



<210> 12 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 12 

ccctctagag gaggagaaaa catg 24 

<210> 13 

<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 



<400> 11 

ccctctagaa ggaggagaaa acatg 



25 



<400> 13 



ttagaaagga ggatttaaat atg 



23 



<210> 14 
<211> 23 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :ribosome 
binding site 

<400> 14 

ttagaaagga ggtttaatta atg 

<210> 15 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :ribosome 
binding site 

<400> 15 

ttagaaagga ggtgatttaa atg 

<210> 16 

<211> 23 
<212> DNA 

<213> Artificial Sequence 
<22d> 

<223> Description of Artificial Sequence: ribosome 
binding, site 

<400> 16 

ttagaaagga ggtgtttaaa atg 

<210> 17 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 17 

attcgagaaa ggaggtgaat ataatatg 

<210> 18 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<400> 18 

attcgagaaa ggaggtgaat aataatg 

<210> 19 
<211> 28 
<212> DNA 
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<220> 

<223> Description of Artificial Sequence zribosome 
binding site 

<400> 19 

attcgtagaa aggaggtgaa ttaatatg 28 



<210> 20 
<211> 6886 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: vector 
<220> 

<223> pAN624 
<400> 20 

aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt tactggctct 60 

tctcgctaac caaaccggta accccgctta ttaaaagcat tctgtaacaa agcgggacca 120 

aagccatgac aaaaacgcgt aacaaaagtg tctataatca cggcagaaaa gtccacattg 180 

attatttgca cggcgtcaca ctttgctatg ccatagcatt tttatccata agattagcgg 240 

atcctacctg acgcttttta tcgcaactct ctactgtttc tccatacccg tttttttggg 300 

ctaacaggag gaattaacca tggatccgag ctcgacagta tcaagcactt cacaatctgg 360 

gagctgaaag cccgccttat gtagctcata cttgacaaat ccaaggtcaa aatggatatt 420 

gtgggcgaca aaataagcgc cgtcaagcaa ttggaatact tcttcagcaa ctgcttcaaa 480 

tggctgttca ttctcgacca tttgattaga gattccagta agctgctcaa taaaagcagg 540 

gattgattta tttggattaa tgtattttga aaaccgctca gtaatttgtc cattttcgat 600 

tacaaccgct gcgatttgta tgattttatc gcctttcttc ggcgaattcc ctgttgtctc 660 

tacatctata acaacgaacc gttgcttatt cattaaaatg gacacctcaa ttcttgcata 720 

cgacaaaagt gtaacacgtt ttgtacggaa atggagcggc aaaaccgttt tactctcaaa 780 

atcttaaaag aaaacccccg ataaaggggg cttttcttct acaaaattgt acgggctggt 840 

tcgttcccca gcatttgttc aattttgttt tgatcattca gaacagccac tttcggctca 900 

tggcttgccg cttcttgatc agacatcatt ttgtaggaaa taataatgac cttatctcct 960 
tcctgcacaa ggcgtgcggc tgcaccgttt aagcatatga cgccgcttcc 'ccgtttacca 1020 

ggaataatat acgtttcaag acgtgctcca ttattattat tcacaatttg tactttttca 1080 

ttaggaagca ttcccacagc atcaatgaga tcctctagag tcgacctgca ggcatgcaag 114 0 

cttccgtcga cgctctccct tatgcgactc ctgcattagg aagcagccca gtagtaggtt 1200 

gaggccgttg agcaccgccg ccgcaaggaa tggtgcatgc aaggagatgg cgcccaacag 1260 

tcccccggcc acggggcctg ccaccatacc cacgccgaaa caagcgctca tgagcccgaa 1320 

gtggcgagcc cgatcttccc catcggtgat gtcggcgata taggcgccag caaccgcacc 1380 

tgtggcgccg gtgatgccgg ccacgatgcg tccggcgtag aggatcaatc ttcatccatt 1440 

ccaaggtaaa tcccccttcg ccgtttctgt taccattata caccttttga accttaacgt 1500 

aaacgttaag ttttaaaaaa caataaaaaa gacgagcagc atacagcacc cgtctttcac 1560 

tttcctgttt aagctaaact tcccgccact gacagagact ctttttgaag gctttcagaa 1620 

agcactcgat acgcgatctg gagctgtaat ataaaaacct tcttcaacta acggggcagg 1680 

ttagtgacat tagaaaaccg actgtaaaaa gtacagtcgg cattatctca tattataaaa 1740 

gccagtcatt aggcctatct gacaattcct gaatagagtt cataaacaat cctgcatgat 1800 

aaccatcaca aacagaatga tgtacctgta aagatagcgg taaatatatt gaattacctt 1860 

tattaatgaa ttttcctgct gtaataatgg gtagaaggta attactatta ttattgatat 1920 

ttaagttaaa cccagtaaat gaagtccatg gaataataga aagagaaaaa gcattttcag 1980 

gtataggtgt tttgggaaac aatttccccg aaccattata tttctctaca tcagaaaggt 2040 

ataaatcata aaactctttg aagtcattct ttacaggagt ccaaatacca gagaatgttt 2100 

tagatacacc atcaaaaatt gtataaagtg gctctaactt atcccaataa cctaactctc 2160 

cgtcgctatt gtaaccagtt ctaaaagctg tatttgagtt tatcaccctt gtcactaaga 2220 

aaataaatgc agggtaaaat ttatatcctt cttgttttat gtttcggtat aaaacactaa 2280 
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tatcaatttc tgtggttata 
cttttctctt ccaattgtct 
tttttatcta aagtgaattt 
ttttttaaaa gtcaatatta 
ccaattttcg tttgttgaac 
aatgtggtct tttgtgtttt 
tatcttgata ataagggtaa 
tcttctgcaa gccaaaaaac 
cggcgagttt catccctctg 
ctatctattt taataatagg 
aatcaacggg cttgtaccaa 
aatgccttcg ttcgtttcta 
gtaaaaggtt ccgtttaacg 
ttcccataaa agcttaatcg 
aaagcatagt aaaatatgct 
gctgaggctt gtcgcgatgt 
cggtctccag cttggctgtt 
atcagaacgc agaagcggtc 
cccacctgac cccatgccga 
gtctccccat gcgagagtag 
aagactgggc ctttcgtttt 
atccgccggg agcggatttg 
gcccgccata aactgccagg 
ttgcgtttct acaaactctt 
catgagacaa taaccctgat 
tcaacatttc cgtgtcgccc 
tcacccagaa acgctggtga 
ttacatcgaa ctggatctca 
ttttccaatg atgagcactt 
cgccgggcaa gagcaactcg 
ctcaccagtc acagaaaagc 
tgccataacc atgagtgata 
gaaggagcta accgcttttt 
ggaaccggag ctgaatgaag 
aatggcaaca acgttgcgca 
acaattaata gactggatgg 
tccggctggc tggtttattg 
cattgcagca ctggggccag 
gagtcaggca actatggatg 
taagcattgg taactgtcag 
tcatttttaa tttaaaagga 
cccttaacgt gagttttcgt 
ttcttgagat cctttttttc 
accagcggtg gtttgtttgc 
cttcagcaga gcgcagatac 
cttcaagaac tctgtagcac 
tgctgccagt ggcgataagt 
taaggcgcag cggtcgggct 
gacctacacc gaactgagat 
agggagaaag gcggacaggt 
ggagcttcca gggggaaacg 
acttgagcgt cgatttttgt 
caacgcggcc tttttacggt 
tgcgttatcc cctgattctg 
tcgccgcagc cgaacgaccg 
gatgcggtat tttctcctta 
cagtacaatc tgctctgatg 
gactgggtca tggctgcgcc 
tgtctgctcc cggcatccgc 
cagaggtttt caccgtcatc 



ctaaaagtcg tttgttggtt 
aaatcaattt tattaaagtt 
aggaggctta cttgtctgct 
ctgtaacata aatatatatt 
taatgggtgc tttagttgaa 
tttaaaggat ttgagcgtag 
ctattgcatg ataagctgtc 
cttccgttac aacgagaagg 
tcccagtcct tttttggatc 
tgcagttcgc aggcgatact 
cacattagcc caattcgata 
aaagcagaac gccttcatca 
ttctggctct catattagtg 
gcgtaaatcc gtgcgtcata 
ggatgacgcc ggcccgatca 
ccttcaattc atctggaaaa 
ttggcggatg agagaagatt 
tgataaaaca gaatttgcct 
actcagaagt gaaacgccgt 
ggaactgcca ggcatcaaat 
atctgttgtt tgtcggtgaa 
aacgttgcga agcaacggcc 
catcaaatta agcagaaggc 
tttgtttatt tttctaaata 
aaatgcttca ataatattga 
ttattccctt ttttgcggca 
aagtaaaaga tgctgaagat 
acagcggtaa gatccttgag 
ttaaagttct gctatgtggc 
gtcgccgcat acactattct 
atcttacgga tggcatgaca 
acactgcggc caacttactt 
tgcacaacat gggggatcat 
ccataccaaa cgacgagcgt 
aactattaac tggcgaacta 
aggcggataa agttgcagga 
ctgataaatc tggagccggt 
atggtaagcc ctcccgtatc 
aacgaaatag acagatcgct 
accaagttta ctcatatata 
tctaggtgaa gatccttttt 
tccactgagc gtcagacccc 
tgcgcgtaat ctgctgcttg 
cggatcaaga gctaccaact 
caaatactgt ccttctagtg 
cgcctacata cctcgctctg 
cgtgtcttac cgggttggac 
gaacgggggg ttcgtgcaca 
acctacagcg tgagctatga 
atccggtaag cggcagggtc 
cctggtatct ttatagtcct 
gatgctcgtc aggggggcgg 
tcctggcctt ttgctggcct 
tggataaccg tattaccgcc 
agcgcagcga gtcagtgagc 
cgcatctgtg cggtatttca 
ccgcatagtt aagccagtat 
ccgacacccg ccaacacccg 
ttacagacaa gctgtgaccg 
accgaaacgc gcgaggcagc 



caaataatga ttaaatatct 2340 
catttgatat gcctcctaaa 2400 
ttcttcatta gaatcaatcc 24 60 
ttaaaaatat cccactttat 2520 
gaataaagac cacattaaaa 2580 
cgaaaaatcc ttttctttct 2640 
aaacatgaga attcccgttt 2700 
attcttcact ttctaaagtt 2760 
aaggcagact gctgcaatgt 2820 
gcccaatgga agtataccaa 2880 
tcggcagaat agattttttt 2940 
tctataccta acgccttacc 3000 
ccaataccga gcgcatagct 3060 
taatcccggt accgtttctc 3120 
attttttccc cagcagcttg 3180 
tcattaggct gctggttaaa 324 0 
ttcagcctga tacagattaa 3300 
ggcggcagta gcgcggtggt 3360 
agcgccgatg gtagtgtggg 3420 
aaaacgaaag gctcagtcga 34 80 
cgctctcctg agtaggacaa 3540 
cggagggtgg cgggcaggac 3600 
catcctgacg gatggccttt 3660 
cattcaaata tgtatccgct 3720 
aaaaggaaga gtatgagtat 3780 
ttttgccttc ctgtttttgc 3840 
cagttgggtg cacgagtggg 3900 
agttttcgcc ccgaagaacg 3960 
gcggtattat cccgtgttga 4 020 
cagaatgact tggttgagta 4 080 
gtaagagaat tatgcagtgc 4140 
ctgacaacga tcggaggacc 4200 
gtaactcgcc ttgatcgttg 4260 
gacaccacga tgcctgtagc 4320 
cttactctag cttcccggca 4380 
ccacttctgc gctcggccct 4440 
gagcgtgggt ctcgcggtat 4500 
gtagttatct acacgacggg 4 560 
gagataggtg cctcactgat 4 620 
ctttagattg atttaaaact 4 680 
gataatctca tgaccaaaat 4740 
gtagaaaaga tcaaaggatc 4800 
caaacaaaaa aaccaccgct 4860 
ctttttccga aggtaactgg 4 920 
tagccgtagt taggccacca 4 980 
ctaatcctgt taccagtggc 5040 
tcaagacgat agttaccgga 5100 
cagcccagct tggagcgaac 5160 
gaaagcgcca cgcttcccga 5220 
ggaacaggag agcgcacgag 5280 
gtcgggtttc gccacctctg 5340 
agcctatgga aaaacgccag 5400 
tttgctcaca tgttctttcc 5460 
tttgagtgag ctgataccgc 5520 
gaggaagcgg aagagcgcct 5580 
caccgcatat ggtgcactct 5640 
acactccgct atcgctacgt 5700 
ctgacgcgcc ctgacgggct 57 60 
tctccgggag ctgcatgtgt 5820 
agatcaattc gcgcgcgaag 5880 
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gcgaagcggc 
tgctactccg 
acatcattca 
tttttaaata 
gcgataggca 
cgccagctta 
gacaagcaaa 
ctgatgtact 
atcgcttcca 
tagcgccctt 
tggtgcgctt 
cattcatgcc 
tccggatgac 
cggcaaacaa 
agaatataac 
tcaatcggcg 
tcattttgcg 



atgcataatg 
tcaagccgtc 
ctttttcttc 
cccgcgagaa 
tccgggtggt 
agacgctaat 
catgctgtgc 
gacaagcctc 
tgcgccgcag 
ccccttgccc 
catccgggcg 
agtaggcgcg 
gaccgtagtg 
attctcgtcc 
ctttcattcc 
ttaaacccgc 
cttcagccat 



tgcctgtcaa 
aattgtctga 
acaaccggca 
atagagttga 
gctcaaaagc 
ccctaactgc 
gacgctggcg 
gcgtacccga 
taacaattgc 
ggcgttaatg 
aaagaacccc 
cggacgaaag 
atgaatctct 
ctgatttttc 
cagcggtcgg 
caccagatgg 
acttttcata 



atggacgaag 
ttcgttacca 
cggaactcgc 
tcgtcaaaac 
agcttcgcct 
tggcggaaaa 
atatcaaaat 
ttatccatcg 
tcaagcagat 
atttgcccaa 
gtattggcaa 
taaacccact 
cctggcggga 
accaccccct 
tcgataaaaa 
gcattaaacg 
ctcccgccat 



cagggattct 
attatgacaa 
tcgggctggc 
caacattgcg 
ggctgatacg 
gatgtgacag 
tgctgtctgc 
gtggatggag 
ttatcgccag 
acaggtcgct 
atattgacgg 
ggtgatacca 
acagcaaaat 
gaccgcgaat 
aatcgagata 
agtatcccgg 
tcagag 



gcaaacccta 
cttgacggct 
cccggtgcat 
accgacggtg 
ttggtcctcg 
acgcgacggc 
caggtgatcg 
cgactcgtta 
cagctccgaa 
gaaatgcggc 
ccagttaagc 
ttcgcgagcc 
atcacccggt 
ggtgagattg 
accgttggcc 
cagcagggga 



5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6886 



<210> 21 
<211> 7140 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 



vector 



<220> 

<223> pAN620 



<400> 21 

tcggcggccg 

caaaagtggc 

ttttgtttcc 

tcctaacagc 

attcctgaaa 

ttaacaacag 

atatttcaca 

ttgttccgac 

aagaaaacga 

aagattttga 

gagtcgatat 

cgattcatgt 

acggggtcgc 

tcggtttaaa 

tggatattga 

ctcgcaatgt 

ttcaaacaag 

ctgcaaaaga 

atccggaact 

ctttttcaaa 

attaatatgt 

aacctgaact 

cttcctaatg 

attattcctg 

gtgcaggaag 

gcaagccatg 

gggaacgaac 

tttcagcctg 

tggcggcagt 

tagcgccgat 



cttcgtcgac 
tgtgactgta 
ttttaaccaa 
acaagagcgg 
aattttgcaa 
caggacgctc 
gctgaaagaa 
gatggggttt 
cgccgttatt 
agcatatccg 
tctttttacg 
agaaagacgc 
gatcgtactg 
agatgcgcag 
attggttcct 
atacttaaca 
tgcggaactt 
tatcattgaa 
cgagcctgtg 
agcgcgttta 
atcgtacgat 
atgtgggaag 
aaaaagtaca 
gtaaacgggg 
gagataaggt 
agccgaaagt 
cagcccgtac 
atacagatta 
agcgcggtgg 
ggtagtgtgg 



cgaaacagca 
aaaagaaatc 
ctgccataac 
aaagatgttt 
aaagttgttg 
tagattagaa 
gccataaaac 
ctgcatgagg 
atgagtattt 
cgcgatattg 
ccagatgctc 
acagacgtgt 
acgaagcttt 
caggtagctg 
gtcgatacgg 
gctgaggaaa 
gtccaagccg 
acgactagcg 
aatgaaattg 
atagataata 
gatgagcggc 
cattacaatt 
aattgtgaat 
aagcggcgtc 
cattattatt 
ggctgttctg 
aattttgtaa 
aatcagaacg 
tcccacctga 
ggtctcccca 



gttataaggc 
gaaaaagacc 
tcgaggccta 
tgttctacat 
actttatcta 
aggaggattt 
aataccattc 
ggcatttaac 
ttgtgaatcc 
agcgggatgc 
atgatatgta 
tatgcgggcg 
tcaatctagt 
ttgttgatgg 
tcagagagga 
gaaaagaagc 
gtgaaagaga 
gaaccataga 
ctggaaagat 
tcattattga 
aaacttcaca 
gatgaagatc 
aataataatg 
atatgcttaa 
tcctacaaaa 
aatgatcaaa 
aggatcctgt 
cagaagcggt 
ccccatgccg 
tgcgagagta 



atgaagctgt 
gttttgtgtg 
cctagcttcc 
ccagaacaac 
caaggtgtgg 
aaatatgaga 
agagggcaag 
cttagcagac 
tgcacaattc 
agctcttgca 
tcccggtgaa 
ctcaagagaa 
caagccgact 
gttaatcagc 
agacggctta 
gcctaagctg 
tcctgaagcg 
ctatgtagag 
gattctcgct 
tattcgtaga 
gggcaactgt 
tcattgatgc 
gagcacgtct 
acggtgcagc 
tgatgtctga 
acaaaattga 
tttggcggat 
ctgataaaac 
aactcagaag 
gggaactgcc 



ccggtttttg 
aaaacggtct 
aagaaagata 
ctctgctaaa 
tataataatc 
cagattactg 
tcaatcggat 
aaagcaagac 
ggccctaatg 
gaaaacgccg 
aagaatgtca 
ggacattttg 
cgtgcctatt 
gacttcttca 
gccaaaagct 
tatcgggccc 
gtgataaaag 
ctttattcct 
gttgcagttg 
aaggaggtga 
taoggaagca 
tgtgggaatg 
tgaaacgtat 
cgcacgcctt 
tcaagaagcg 
acaaatgctg 
gagagaagat 
agaatttgcc 
tgaaacgccg 
aggcatcaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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taaaacgaaa ggctcagtcg aaagactggg cctttcgttt tatctgttgt ttgtcggtga 1860 
acgctctcct gagtaggaca aatccgccgg gagcggattt gaacgttgcg aagcaacggc 1920 
ccggagggtg gcgggcagga cgcccgccat aaactgccag gcatcaaatt aagcagaagg 1980 
ccatcctgac ggatggcctt tttgcgtttc tacaaactct tggtaccgag acgatcgtcc 2040 
tctttgttgt agcccatcac ttttgctgaa gagtaggagc cgaaagtgac ggcgtattca 2100 
ttgagcggca gctgagtcgc accgacagaa atcgcttctc ttgatgtgcc cggcgatccg 2160 
actgtccagc cgttcggtcc gctgttgccg tttgaggtaa cagcgacaac gccttctgac 2220 
atggcccagt caagcgctgt gcttgtcgcc cagtccgggt tgtttaaaga gtttccgaga 2280 
gacaggttca tcacatctgc cccgtcctgc actgcacgtt ccacgcccgc gatgacgttt 2340 
tccgttgtgc cgcttccgcc aggccctaac acacgataag caagaagtgt ggcatcaggc 24 00 
gctacgcctt taatcgttcc gtttgcagcc acagttccgg ctacgtgtgt gccatggtca 24 60 
gttgcctcgc ccctcggatc gccggttggt gtttcttttg gatcgtaatc attgtccaca 2520 
aaatcgtatc ctttatattg tccaaagttt ttcttcagat ctgggtgatt gtattcaacc 2580 
ccagtgtcaa taatcgccac cttgatgcct tttcctgtgt agcctaaatc ccatgcatcg 2640 
tttgctccga tataaggcgc actgtcatcc atttgcggag atacggcgtc ttcggagatt 2700 
gtggggaatt ctcatgtttg acagcttatc atgcaatagt tacccttatt atcaagataa 2760 
gaaagaaaag gatttttcgc tacgctcaaa tcctttaaaa aaacacaaaa gaccacattt 2820 
tttaatgtgg tctttattct tcaactaaag cacccattag ttcaacaaac gaaaattgga 2880 
taaagtggga tatttttaaa atatatattt atgttacagt aatattgact tttaaaaaag 2940 
gattgattct aatgaagaaa gcagacaagt aagcctccta aattcacttt agataaaaat 3000 
ttaggaggca tatcaaatga actttaataa aattgattta gacaattgga agagaaaaga 3060 
gatatttaat cattatttga accaacaaac gacttttagt ataaccacag aaattgatat 3120 
tagtgtttta taccgaaaca taaaacaaga aggatataaa ttttaccctg catttatttt 3180 
cttagtgaca agggtgataa actcaaatac agcttttaga actggttaca atagcgacgg 3240 
agagttaggt tattgggata agttagagcc actttataca atttttgatg gtgtatctaa 3300 
aacattctct ggtatttgga ctcctgtaaa gaatgacttc aaagagtttt atgatttata 3360 
cctttctgat gtagagaaat ataatggttc ggggaaattg tttcccaaaa cacctatacc 3420 
tgaaaatgct ttttctcttt ctattattcc atggacttca tttactgggt ttaacttaaa 34 80 
tatcaataat aatagtaatt accttctacc cattattac'a gcaggaaaat tcattaataa 3540 
aggtaattca atatatttac cgctatcttt acaggtacat cattctgttt gtgatggtta 3600 
tcatgcagga ttgtttatga actctattca ggaattgtca gataggccta atgactggct 3660 
tttataatat gagataatgc cgactgtact ttttacagtc ggttttctaa tgtcactaac 3720 
ctgccccgtt agttgaagaa cgaagcggcc gcaattcttg aagacgaaag ggcctcgtga 3780 
tacgcctatt tttataggtt aatgtcatga taataatggt ttcttagacg tcaggtggca 3840 
ctthtcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata 3900 
tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga 3960 . 
gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc 4020 • 
ctgtttttgc tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg 4080 
cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc 4140 
ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat 4200 
cccgtattga cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact 4260 
tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat 4 320 
tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga 4 380 
tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc 4 4 40 
ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga 4 500 
tgcctgcagc aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag 4 560 
cttcccggca acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc 4 620 
gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt 4 680 
ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct 4 740 
acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg 4 800 
cctcactgat taagcattgg taactgtcag accaagttta ctcatatata ctttagattg 4 860 
atttaaaact tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca 4 920 
tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga 4 980 
tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa 5040 
aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga 5100 
aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt 5160 
taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt 5220 
taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat 5280 
agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct 5340 
tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca 5400 
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cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag 54 60 
agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc 5520 
gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga 5580 
aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca 5640 
tgttctttcc tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag 5700 
ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg 5760 
aagagcgcct gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat 5820 
ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagtat acactccgct 5880 
atcgctacgt gactgggtca tggctgcgcc ccgacacccg ccaacacccg ctgacgcgcc 5940 
ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag 6000 
ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgaggcagc tgcggtaaag 6060 
ctcatcagcg tggtcgtgaa gcgattcaca gatgtctgcc tgttcatccg cgtccagctc 6120 
gttgagtttc tccagaagcg ttaatgtctg gcttctgata aagcgggcca tgttaagggc 6180 
ggttttttcc tgtttggtca cttgatgcct ccgtgtaagg gggaatttct gttcatgggg 6240 
gtaatgatac cgatgaaacg agagaggatg ctcacgatac gggttactga tgatgaacat 6300 
gcccggttac tggaacgttg tgagggtaaa caactggcgg tatggatgcg gcgggaccag 6360 
agaaaaatca ctcagggtca atgccagcgc ttcgttaata cagatgtagg tgttccacag 6420 
ggtagccagc agcatcctgc gatgcagatc cggaacataa tggtgcaggg cgctgacttc 6480 
cgcgtttcca gactttacga aacacggaaa ccgaagacca ttcatgttgt tgctcaggtc 6540 
gcagacgttt tgcagcagca gtcgcttcac gttcgctcgc gtatcggtga ttcattctgc 6600 
taaccagtaa ggcaaccccg ccagcctagc cgggtcctca acgacaggag cacgatcatg 6660 
cgcacccgtg gccaggaccc aacgctgccc gagatgcgcc gcgtgcggct gctggagatg 6720 
gcggacgcga tggatatgtt ctgccaaggg ttggtttgcg cattcacagt tctccgcaag 6780 
aattgattgg ctccaattct tggagtggtg aatccgttag cgaggtgccg ccggcttcca 684 0 
ttcaggtcga ggtggcccgg ctccatgcac cgcgacgcaa cgcggggagg cagacaaggt 6900 
atagggcggc gcctacaatc catgccaacc cgttccatgt gctcgccgag gcggcataaa 6960 
tcgccgtgac gatcagcggt ccagtgatcg aagttaggct ggtaagagcc gcgagcgatc 7020 
cttgaagctg tccctgatgg tcgtcatcta cctgcctgga cagcatggcc tgcaacgcgg 7080 
gcatcccgat gccgccggaa gcgagaagaa tcataatggg gaaggccatc cagcctcgcg 714 0 

<210> 22 
<211> 6725 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: vector 
<220> 

<223> pAN636 
<400> 22 

tcggcggccg cttcgtcgac cgaaacagca gttataaggc atgaagctgt ccggtttttg 60 

caaaagtggc tgtgactgta aaaagaaatc gaaaaagacc gttttgtgtg aaaacggtct 120 

ttttgtttcc ttttaaccaa ctgccataac tcgaggccta cctagcttcc aagaaagata 180 

tcctaacagc acaagagcgg aaagatgttt tgttctacat ccagaacaac ctctgctaaa 240 

attcctgaaa aattttgcaa aaagttgttg actttatcta caaggtgtgg tataataatc 300 

ttaacaacag caggacgctc tagaaggagg agaaaacatg aaaacaaaac tggattttct 360 

aaaaatgaag gagtctgaag aaccgattgt catgctgacc gcttatgatt atccggcagc 420 

taaacttgct gaacaagcgg gagttgacat gattttagtc ggtgattcac ttggaatggt 480 

cgtcctcggc cttgattcaa ctgtcggtgt gacagttgcg gacatgatcc atcatacaaa 540 

agccgttaaa aggggtgcgc cgaatacctt tattgtgaca gatatgccgt ttatgtctta 600 

tcacctgtct aaggaagata cgctgaaaaa tgcagcggct atcgttcagg aaagcggagc 660 

tgacgcactg aagcttgagg gcggagaagg cgtgtttgaa tccattcgcg cattgacgct 720 

tggaggcatt ccagtagtca gtcacttagg tttgacaccg cagtcagtcg gcgtactggg 780 

cggctataaa gtacagggca aagacgaaca aagcgccaaa aaattaatag aagacagtat 840 

aaaatgcgaa gaagcaggag ctatgatgct tgtgctggaa tgtgtgccgg cagaactcac 900 

agccaaaatt gccgagacgc taagcatacc ggtcattgga atcggggctg gtgtgaaagc 960 

ggacggacaa gttctcgttt atcatgatat tatcggccac ggtgttgaga gaacacctaa 1020 

atttgtaaag caatatacgc gcattgatga aaccatcgaa acagcaatca gcggatatgt 1080 
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tcaggatgta 
agtgcttgac 
aagattttca 
ttgcctggcg 
cgccgtagcg 
tcaaataaaa 
ggtgaacgct 
acggcccgga 
gaaggccatc 
cgtcctcttt 
attcattgag 
atccgactgt 
ctgacatggc 
cgagagacag 
cgttttccgt 
caggcgctac 
ggtcagttgc 
ccacaaaatc 
caaccccagt 
catcgtttgc 
agattgtggg 
gataagaaag 
cattttttaa 
ttggataaag 
aaaaggattg 
aaaatttagg 
aaagagatat 
gatattagtg 
attttcttag 
gacggagagt 
tctaaaacat 
ttataccttt 
atacctgaaa 
ttaaatatca 
aataaaggta 
ggttatcatg 
tggcttttat 
ctaacctgcc 
cgtgatacgc 
tggcactttt 
aaatatgtat 
gaagagtatg 
ccttcctgtt 
gggtgcacga 
tcgccccgaa 
attatcccgt 
tgacttggtt 
agaattatgc 
aacgatcgga 
tcgccttgat 
cacgatgcct 
tctagcttcc 
tctgcgctcg 
tgggtctcgc 
tatctacacg 
aggtgcctca 
gattgattta 
tctcatgacc 
aaagatcaaa 
aaaaaaacca 



agacatcgtg 
ggcttgtacg 
gcctgataca 
gcagtagcgc 
ccgatggtag 
cgaaaggctc 
ctcctgagta 
gggtggcggg 
ctgacggatg 
gttgtagccc 
cggcagctga 
ccagccgttc 
ccagtcaagc 
gttcatcaca 
tgtgccgctt 
gcctttaatc 
ctcgcccctc 
gtatccttta 
gtcaataatc 
tccgatataa 
gaattctcat 
aaaaggattt 
tgtggtcttt 
tgggatattt 
attctaatga 
aggcatatca 
ttaatcatta 
ttttataccg 
tgacaagggt 
taggttattg 
tctctggtat 
ctgatgtaga 
atgctttttc 
ataataatag 
attcaatata 
caggattgtt 
aatatgagat 
ccgttagttg 
ctatttttat 
cggggaaatg 
ccgctcatga 
agtattcaac 
tttgctcacc 
gtgggttaca 
gaacgttttc 
attgacgccg 
gagtactcac 
agtgctgcca 
ggaccgaagg 
cgttgggaac 
gcagcaatgg 
cggcaacaat 
gcccttccgg 
ggtatcattg 
acggggagtc 
ctgattaagc 
aaacttcatt 
aaaatccctt 
ggatcttctt 
ccgctaccag 



ctttccctga 
ggggaaaata 
gattaaatca 
ggtggtccca 
tgtggggtct 
agtcgaaaga 
ggacaaatcc 
caggacgccc 
gcctttttgc 
atcacttttg 
gtcgcaccga 
ggtccgctgt 
gctgtgcttg 
tctgccccgt 
ccgccaggcc 
gttccgtttg 
ggatcgccgg 
tattgtccaa 
gccaccttga 
ggcgcactgt 
gtttgacagc 
ttcgctacgc 
attcttcaac 
ttaaaatata 
agaaagcaga 
aatgaacttt 
tttgaaccaa 
aaacataaaa 
gataaactca 
ggataagtta 
ttggactcct 
gaaatataat 
tctttctatt 
taattacctt 
tttaccgcta 
tatgaactct 
aatgccgact 
aagaacgaag 
aggttaatgt 
tgcgcggaac 
gacaataacc 
atttccgtgt 
cagaaacgct 
tcgaactgga 
caatgatgag 
ggcaagagca 
cagtcacaga 
taaccatgag 
'agctaaccgc 
cggagctgaa 
caacaacgtt 
taatagactg 
ctggctggtt 
cagcactggg 
aggcaactat 
attggtaact 
tttaatttaa 
aacgtgagtt 
gagatccttt 
cggtggtttg 



acaaaagcat 
agggggggat 
gaacgcagaa 
cctgacccca 
ccccatgcga 
ctgggccttt 
gccgggagcg 
gccataaact 
gtttctacaa 
ctgaagagta 
cagaaatcgc 
tgccgtttga 
tcgcccagtc 
cctgcactgc 
ctaacacacg 
cagccacagt 
ttggtgtttc 
agtttttctt 
tgccttttcc 
catccatttg 
ttatcatgca 
tcaaatcctt 
taaagcaccc 
tatttatgtt 
caagtaagcc 
aataaaattg 
caaacgactt 
caagaaggat 
aatacagctt 
gagccacttt 
gtaaagaatg 
ggttcgggga 
attccatgga 
ctacccatta 
tctttacagg 
attcaggaat 
gtacttttta 
cggccgcaat 
catgataata 
ccctatttgt 
ctgataaatg 
cgcccttatt 
ggtgaaagta 
tctcaacagc 
cacttttaaa 
actcggtcgc 
aaagcatctt 
tgataacact 
ttttttgcac 
tgaagccata 
gcgcaaacta 
gatggaggcg 
tattgctgat 
gccagatggt 
ggatgaacga 
gtcagaccaa 
aaggatctag 
ttcgttccac 
ttttctgcgc 
tttgccggat 



tcctttcaaa 
cctgttttgg 
gcggtctgat 
tgccgaactc 
gagtagggaa 
cgttttatct 
gatttgaacg 
gccaggcatc 
actcttggta 
ggagccgaaa 
ttctcttgat 
ggtaacagcg 
cgggttgttt 
acgttccacg 
ataagcaaga 
tccggctacg 
ttttggatcg 
cagatctggg 
tgtgtagcct 
cggagatacg 
atagttaccc 
taaaaaaaca 
attagttcaa 
acagtaatat 
tcctaaattc 
atttagacaa 
ttagtataac 
ataaatttta 
ttagaactgg 
atacaatttt 
acttcaaaga 
aattgtttcc 
cttcatttac 
ttacagcagg 
tacatcattc 
tgtcagatag 
cagtcggttt 
tcttgaagac 
atggtttctt 
ttatttttct 
cttcaataat 
cccttttttg 
aaagatgctg 
ggtaagatcc 
gttctgctat 
cgcatacact 
acggatggca 
gcggccaact 
aacatggggg 
ccaaacgacg 
ttaactggcg 
gataaagttg 
aaatctggag 
aagccctccc 
aatagacaga 
gtttactcat 
gtgaagatcc 
tgagcgtcag 
gtaatctgct 
caagagctac 



tgaaccagac 
Cggatgagag 
aaaacagaat 
agaagtgaaa 
ctgccaggca 
gttgtttgtc 
ttgcgaagca 
aaattaagca 
ccgagacgat 
gtgacggcgt 
gtgcccggcg 
acaacgcctt 
aaagagtttc 
cccgcgatga 
agtgtggcat 
tgtgtgccat 
taatcattgt 
tgattgtatt 
aaatcccatg 
gcgtcttcgg 
ttattatcaa 
caaaagacca 
caaacgaaaa 
tgacttttaa 
actttagata 
ttggaagaga 
cacagaaatt 
ccctgcattt 
ttacaatagc 
tgatggtgta 
gttttatgat 
caaaacacct 
tgggtttaac 
aaaattcatt 
tgtttgtgat 
gcctaatgac 
tctaatgtca 
gaaagggcct 
agacgtcagg 
aaatacattc 
attgaaaaag 
cggcattttg 
aagatcagtt 
ttgagagttt 
gtggcgcggt 
attctcagaa 
tgacagtaag 
tacttctgac 
atcatgtaac 
agcgtgacac 
aactacttac 
caggaccact 
ccggtgagcg 
gtatcgtagt 
tcgctgagat 
atatacttta 
tttttgataa 
accccgtaga 
gcttgcaaac 
caactctttt 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
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tccgaaggta 
gtagttaggc 
cctgttacca 
acgatagtta 
cagcttggag 
cgccacgctt 
aggagagcgc 
gtttcgccac 
atggaaaaac 
tcacatgttc 
gtgagctgat 
agcggaagag 
catatggtgc 
ccgctatcgc 
gcgccctgac 
gggagctgca 
taaagctcat 
agctcgttga 
agggcggttt 
tgggggtaat 
aacatgcccg 
accagagaaa 
cacagggtag 
acttccgcgt 
aggtcgcaga 
tctgctaacc 
tcatgcgcac 
agatggcgga 
gcaagaattg 
ttccattcag 
aaggtatagg 
ataaatcgcc 
cgatccttga 
cgcgggcatc 
tcgcg 



actggcttca 
caccacttca 
gtggctgctg 
ccggataagg 
cgaacgacct 
cccgaaggga 
acgagggagc 
ctctgacttg 
gccagcaacg 
tttcctgcgt 
accgctcgcc 
cgcctgatgc 
actctcagta 
tacgtgactg 
gggcttgtct 
tgtgtcagag 
cagcgtggtc 
gtttctccag 
tttcctgttt 
gataccgatg 
gttactggaa 
aatcactcag 
ccagcagcat 
ttccagactt 
cgttttgcag 
agtaaggcaa 
ccgtggccag 
cgcgatggat 
attggctcca 
gtcgaggtgg 
gcggcgccta 
gtgacgatca 
agctgtccct 
ccgatgccgc 



gcagagcgca 
agaactctgt 
ccagtggcga 
cgcagcggtc 
acaccgaact 
gaaaggcgga 
ttccaggggg 
agcgtcgatt 
cggccttttt 
tatcccctga 
gcagccgaac 
ggtattttct 
caatctgctc 
ggtcatggct 
gctcccggca 
gttttcaccg 
gtgaagcgat 
aagcgttaat 
ggtcacttga 
aaacgagaga 
cgttgtgagg 
ggtcaatgcc 
cctgcgatgc 
tacgaaacac 
cagcagtcgc 
ccccgccagc 
gacccaacgc 
atgttctgcc 
attcttggag 
cccggctcca 
caatccatgc 
gcggtccagt 
gatggtcgtc 
cggaagcgag 



gataccaaat 
agcaccgcct 
taagtcgtgt 
gggctgaacg 
gagataccta 
caggtatccg 
aaacgcctgg 
tttgtgatgc 
acggttcctg 
ttctgtggat 
gaccgagcgc 
ccttacgcat 
tgatgccgca 
gcgccccgac 
tccgcttaca 
tcatcaccga 
tcacagatgt 
gtctggcttc 
tgcctccgtg 
ggatgctcac 
gtaaacaact 
agcgcttcgt 
agatccggaa 
ggaaaccgaa 
ttcacgttcg 
ctagccgggt 
tgcccgagat 
aagggttggt 
tggtgaatcc 
tgcaccgcga 
caacccgttc 
gatcgaagtt 
atctacctgc 
aagaatcata 



actgtccttc 
acatacctcg 
cttaccgggt 
gggggttcgt 
cagcgtgagc 
gtaagcggca 
tatctttata 
tcgtcagggg 
gccttttgct 
aaccgtatta 
agcgagtcag 
ctgtgcggta 
tagttaagcc 
acccgccaac 
gacaagctgt 
aacgcgcgag 
ctgcctgttc 
tgataaagcg 
taagggggaa 
gatacgggtt 
ggcggtatgg 
taatacagat 
cataatggtg 
gaccattcat 
ctcgcgtatc 
cctcaacgac 
gcgccgcgtg 
ttgcgcattc 
gttagcgagg 
cgcaacgcgg 
catgtgctcg 
aggctggtaa 
ctggacagca 
atggggaagg 



tagtgtagcc 
ctctgctaat 
tggactcaag 
gcacacagcc 
tatgagaaag 
gggtcggaac 
gtcctgtcgg 
ggcggagcct 
ggccttttgc 
ccgcctttga 
tgagcgagga 
tttcacaccg 
agtatacact 
acccgctgac 
gaccgtctcc 
gcagctgcgg 
atccgcgtcc 
ggccatgtta 
tttctgttca 
actgatgatg 
atgcggcggg 
gtaggtgttc 
cagggcgctg 
gttgttgctc 
ggtgattcat 
aggagcacga 
cggctgctgg 
acagttctcc 
tgccgccggc 
ggaggcagac 
ccgaggcggc 
gagccgcgag 
tggcctgcaa 
ccatccagcc 



4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6725 



<210> 23 
<211> 6806 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: vector 
<220> 

<223> pAN637 



<400> 23 

tcggcggccg 

caaaagtggc 

ttttgtttcc 

tcctaacagc 

attcctgaaa 

ttaacaacag 

aagattggaa 

tatcacgacg 

ggaatccggc 

atcaattcgg 

ttttcgtcgc 

catatccacg 



cttcgtcgac 
tgtgactgta 
ttttaaccaa 
acaagagcgg 
aattttgcaa 
caggacgctc 
tcattggcgg 
tgactgttgt 
tttataaagg 
actttgacct 
ttgaacgaat 
acctaaaaga 



cgaaacagca 
aaaagaaatc 
ctgccataac 
aaagatgttt 
aaagttgttg 
tagacaattg 
aggctccgtt 
gacgaggcgg 
cggggaggaa 
gcttgtcgtg 
cgggaagacg 
ctggcacgtt 



gttataaggc 
gaaaaagacc 
tcgaggccta 
tgttctacat 
actttatcta 
agatcttaag 
ggtcttttat 
caagaacagg 
ttcagggctg 
acagtgaagc 
aatatattat 
ggccattcca 



atgaagctgt 
gttttgtgtg 
cctagcttcc 
ccagaacaac 
caaggtgtgg 
aaaggaggtg 
gcgcctatta 
ctgcggccat 
attgcagtgc 
agcatcagct 
ttttgcaaaa 
tttatgttgg 



ccggtttttg 60 
aaaacggtct 120 
aagaaagata 180 
ctctgctaaa 240 
tataataatc 300 
ttaattaatg 360 
tttgtcactt 420 
tcagtctgaa 480 
ggacacgagt 540 
tcaatctgtt 600 
cggcatgggg 660 
aatcgttgag 720 
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cacggagctg taagaaaatc ggatacagct 
tggagcgcgt tcgacgatgc tgaaccagac 
tcggattttc cgatttatta tgagacggat 
gtaaatgcgt gtattaatcc tttaactgcg 
acaacgccag cttatctggc ttttatgaag 
aaacttgaaa atgaagaaaa ggcttgggag 
gagaatcgtt catcaatgct ggttgacgtc 
attatcggat acttattgaa ggaagcaagt 
tttttatatg gcagcatcaa agcattggag 
tcctgttttg gcggatgaga gaagattttc 
agcggtctga taaaacagaa tttgcctggc 
atgccgaact cagaagtgaa acgccgtagc 
agagtaggga actgccaggc atcaaataaa 
tcgttttatc tgttgtttgt cggtgaacgc 
ggatttgaac gttgcgaagc aacggcccgg 
tgccaggcat caaattaagc agaaggccat 
aactcttggt accgagacga tcgtcctctt 
aggagccgaa agtgacggcg tattcattga 
cttctcttga tgtgcccggc gatccgactg 
aggtaacagc gacaacgcct tctgacatgg 
ccgggttgtt taaagagttt ccgagagaca 
cacgttccac gcccgcgatg acgttttccg 
gataagcaag aagtgtggca tcaggcgcta 
ttccggctac gtgtgtgcca tggtcagttg 
cttttggatc gtaatcattg tccacaaaat 
tcagatctgg gtgattgtat tcaaccccag 
ctgtgtagcc taaatcccat gcatcgtttg 
gcggagatac ggcgtcttcg gagattgtgg 
aatagttacc cttattatca agataagaaa 
ttaaaaaaac acaaaagacc acatttttta 
cattagttca acaaacgaaa attggataaa 
tacagtaata ttgactttta aaaaaggatt 
ctcctaaatt cactttagat aaaaatttag 
gatttagaca attggaagag aaaagagata 
tttagtataa ccacagaaat tgatattagt 
tataaatttt accctgcatt tattttctta 
tttagaactg gttacaatag cgacggagag 
tatacaattt ttgatggtgt atctaaaaca 
gacttcaaag agttttatga tttatacctt 
aaattgtttc ccaaaacacc tatacctgaa 
acttcattta ctgggtttaa cttaaatatc 
attacagcag gaaaattcat taataaaggt 
gtacatcatt ctgtttgtga tggttatcat 
ttgtcagata ggcctaatga ctggctttta 
acagtcggtt ttctaatgtc actaacctgc 
ttcttgaaga cgaaagggcc tcgtgatacg 
aatggtttct tagacgtcag gtggcacttt 
tttatttttc taaatacatt caaatatgta 
gcttcaataa tattgaaaaa ggaagagtat 
tccctttttt gcggcatttt gccttcctgt 
aaaagatgct gaagatcagt tgggtgcacg 
cggtaagatc cttgagagtt ttcgccccga 
agttctgcta tgtggcgcgg tattatcccg 
ccgcatacac tattctcaga atgacttggt 
tacggatggc atgacagtaa gagaattatg 
tgcggccaac ttacttctga caacgatcgg 
caacatgggg gatcatgtaa ctcgccttga 
accaaacgac gagcgtgaca ccacgatgcc 
attaactggc gaactactta ctctagcttc 
ggataaagtt gcaggaccac ttctgcgcto 



gttgatcata caggcctagg tgcgataaaa 780 
cggctgaaca tcttgtttca gcataaccat 840 
tggtaccgtc tgctgacggg caagctgatt 900 
ttattgcaag tgaaaaatgg agaactgctg 960 
ctggtatttc aggaggcatg ccgcatttta 1020 
cgggttcagg ccgtttgtgg gcaaacgaaa 1080 
attggaggcc ggcagacgga agctgacgcc 1140 
cttcaaggtc ttgatgccgt ccacctagag 1200 
cgaaatacca acaaagtggt ttactaagga 1260 
agcctgatac agattaaatc agaacgcaga 1320 
ggcagtagcg cggtggtccc acctgacccc 1380 
gccgatggta gtgtggggtc tccccatgcg 14 40 
acgaaaggct cagtcgaaag actgggcctt 1500 
tctcctgagt aggacaaatc cgccgggagc 1560 
agggtggcgg gcaggacgcc cgccataaac 1620 
cctgacggat ggcctttttg cgtttctaca 1680 
tgttgtagcc catcactttt gctgaagagt 1740 
gcggcagctg agtcgcaccg acagaaatcg 1800 
tccagccgtt cggtccgctg ttgccgtttg 1860 
cccagtcaag cgctgtgctt gtcgcccagt 1920 
ggttcatcac atctgccccg tcctgcactg 1980 
ttgtgccgct tccgccaggc cctaacacac 2040 
cgcctttaat cgttccgttt gcagccacag 2100 
cctcgcccct cggatcgccg gttggtgttt 2160 
cgtatccttt atattgtcca aagtttttct 2220 
tgtcaataat cgccaccttg atgccttttc 2280 
ctccgatata aggcgcactg tcatccattt 234 0 
ggaattctca tgtttgacag cttatcatgc 2400 
gaaaaggatt tttcgctacg ctcaaatcct 24 60 
atgtggtctt tattcttcaa ctaaagcacc 2520 
gtgggatatt tttaaaatat atatttatgt 2580 
gattctaatg aagaaagcag acaagtaagc 2640 
gaggcatatc aaatgaactt taataaaatt 2700 
tttaatcatt atttgaacca acaaacgact 2760 
gttttatacc gaaacataaa acaagaagga 2820 
gtgacaaggg tgataaactc aaatacagct 2880 
ttaggttatt gggataagtt agagccactt 2940 
ttctctggta tttggactcc tgtaaagaat 3000 
tctgatgtag agaaatataa tggttcgggg 3060 
aatgcttttt ctctttctat tattccatgg 3120 
aataataata gtaattacct tctacccatt 3180 
aattcaatat atttaccgct atctttacag 3240 
gcaggattgt ttatgaactc tattcaggaa 3300 
taatatgaga taatgccgac tgtacttttt 3360 
cccgttagtt gaagaacgaa gcggccgcaa 3420 
cctattttta taggttaatg tcatgataat 3480 
tcggggaaat gtgcgcggaa cccctatttg 354 0 
tccgctcatg agacaataac cctgataaat 3600 
gagtattcaa catttccgtg tcgcccttat 3660 
ttttgctcac ccagaaacgc tggtgaaagt 3720 
agtgggttac atcgaactgg atctcaacag 3780 
agaacgtttt ccaatgatga gcacttttaa 3840 
tattgacgcc gggcaagagc aactcggtcg 3900 
tgagtactca ccagtcacag aaaagcatct 3960 
cagtgctgcc ataaccatga gtgataacac 4020 
aggaccgaag gagctaaccg cttttttgca 4080 
tcgttgggaa ccggagctga atgaagccat 4140 
tgcagcaatg gcaacaacgt tgcgcaaact 4200 
ccggcaacaa ttaatagact ggatggaggc 4260 
ggcccttccg gctggctggt ttattgctga 4320 
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taaatctgga 
taagccctcc 
aaatagacag 
agtttactca 
ggtgaagatc 
ctgagcgtca 
cgtaatctgc 
tcaagagcta 
tactgtcctt 
tacatacctc 
tcttaccggg 
ggggggttcg 
acagcgtgag 
ggtaagcggc 
gtatctttat 
ctcgtcaggg 
ggccttttgc 
taaccgtatt 
cagcgagtca 
tctgtgcggt 
atagttaagc 
cacccgccaa 
agacaagctg 
aaacgcgcga 
tctgcctgtt 
ctgataaagc 
gtaaggggga 
cgatacgggt 
tggcggtatg 
ttaatacaga 
acataatggt 
agaccattca 
gctcgcgtat 
tcctcaacga 
tgcgccgcgt 
tttgcgcatt 
cgttagcgag 
acgcaacgcg 
ccatgtgctc 
taggctggta 
cctggacagc 
aatggggaag 



gccggtgagc 
cgtatcgtag 
atcgctgaga 
tatatacttt 
ctttttgata 
gaccccgtag 
tgcttgcaaa 
ccaactcttt 
ctagtgtagc 
gctctgctaa 
ttggactcaa 
tgcacacagc 
ctatgagaaa 
agggtcggaa 
agtcctgtcg 
gggcggagcc 
tggccttttg 
accgcctttg 
gtgagcgagg 
atttcacacc 
cagtatacac 
cacccgctga 
tgaccgtctc 
ggcagctgcg 
catccgcgtc 
gggccatgtt 
atttctgttc 
tactgatgat 
gatgcggcgg 
tgtaggtgtt 
gcagggcgct 
tgttgttgct 
cggtgattca 
caggagcacg 
gcggctgctg 
cacagttctc 
gtgccgccgg 
gggaggcaga 
gccgaggcgg 
agagccgcga 
atggcctgca 
gccatccagc 



gtgggtctcg 
ttatctacac 
taggtgcctc 

agattgattt 
atctcatgac 
aaaagatcaa 
caaaaaaacc 
ttccgaaggt 
cgtagttagg 
tcctgttacc 
gacgatagtt 
ccagcttgga 
gcgccacgct 
caggagagcg 
ggtttcgcca 
tatggaaaaa 
ctcacatgtt 
agtgagctga 
aagcggaaga 
gcatatggtg 
tccgctatcg 
cgcgccctga 
cgggagctgc 
gtaaagctca 
cagctcgttg 
aagggcggtt 
atgggggtaa 
gaacatgccc 
gaccagagaa 
ccacagggta 
gacttccgcg 
caggtcgcag 
ttctgctaac 
atcatgcgca 
gagatggcgg 
cgcaagaatt 
cttccattca 
caaggtatag 
cataaatcgc 
gcgatccttg 
acgcgggcat 
ctcgcg 



cggtatcatt 
gacggggagt 
actgattaag 
aaaacttcat 
caaaatccct 
aggatcttct 
accgctacca 
aactggcttc 
ccaccacttc 
agtggctgct 
accggataag 
gcgaacgacc 
tcccgaaggg 
cacgagggag 
cctctgactt 
cgccagcaac 
ctttcctgcg 
taccgctcgc 
gcgcctgatg 
cactctcagt 
ctacgtgact 
cgggcttgtc 
atgtgtcaga 
tcagcgtggt 
agtttctcca 
ttttcctgtt 
tgataccgat 
ggttactgga 
aaatcactca 
gccagcagca 
tttccagact 
acgttttgca 
cagtaaggca 
cccgtggcca 
acgcgatgga 
gattggctcc 
ggtcgaggtg 
ggcggcgcct 
cgtgacgatc 
aagctgtccc 
cccgatgccg 



gcagcactgg 
caggcaacta 
cattggtaac 
ttttaattta 
taacgtgagt 
tgagatcctt 
gcggtggttt 
agcagagcgc 
aagaactctg 
gccagtggcg 
gcgcagcggt 
tacaccgaac 
agaaaggcgg 
cttccagggg 
gagcgtcgat 
gcggcctttt 
ttatcccctg 
cgcagccgaa 
cggtattttc 
acaatctgct 
gggtcatggc 
tgctcccggc 
ggttttcacc 
cgtgaagcga 
gaagcgttaa 
tggtcacttg 
gaaacgagag 
acgttgtgag 
gggtcaatgc 
tcctgcgatg 
ttacgaaaca 
gcagcagtcg 
accccgccag 
ggacccaacg 
tatgttctgc 
aattcttgga 
gcccggctcc 
acaatccatg 
agcggtccag 
tgatggtcgt 
ccggaagcga 



ggccagatgg 
tggatgaacg 
tgtcagacca 
aaaggatcta 
tttcgttcca 
tttttctgcg 
gtttgccgga 
agataccaaa 
tagcaccgcc 
ataagtcgtg 
cgggctgaac 
tgagatacct 
acaggtatcc 
gaaacgcctg 
ttttgtgatg 
tacggttcct 
attctgtgga 
cgaccgagcg 
tccttacgca 
ctgatgccgc 
tgcgccccga 
atccgcttac 
gtcatcaccg 
ttcacagatg 
tgtctggctt 
atgcctccgt 
aggatgctca 
ggtaaacaac 
cagcgcttcg 
cagatccgga 
cggaaaccga 
cttcacgttc 
cctagccggg 
ctgcccgaga 
caagggttgg 
gtggtgaatc 
atgcaccgcg 
ccaacccgtt 
tgatcgaagt 
catctacctg 
gaagaatcat 



4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6806 



<210> 24 
<211> 3867 
<212> DNA ' 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: vector 



<400> 24 

aagctttctc aagaagcgaa caagaaaaaa 

gtcgctagat tcagcgccaa tgcgtctaaa 

ctcgaaaaaa tcacgctgga tgatattaaa 

ttcacgccgg aacgggaaat cggaaatgat 

atcgatggcg taaaggtgct tgacaatgtc 

gctttcactg gccgaaatga acttgctgtt 

atggaagctg acagcggaac gtttaaatgg 

aaagacaaca gcgaatattt cgaaggcagt 



gaagagcaga ttaaacagct tcaagagttt 60 
tctaagcagg ctacatcaag aaagaaactt 120 
ccgtcttccc gccgctatcc ttatgttaac 180 
gttcttcgcg tggaaggctt aacaaaaaca 24 0 
agctttatta tgaatcgaga agataaaatt 300 
actacgctgt ttaaaatcat ttccggggaa 360 
ggtgttacca catctcaagc gtattttcca 420 
gatctgaacc ttgtagactg gcttcgccaa 480 
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tattctccgc acgaccaaag tgagagcttt 
tctggagaag aagtccacaa aaaagcaaat 
atgctgtcga aagcgatgct ttctggcgcc 
catttagacc tagagt'ccat tacagcgctc 
atgctgttta cttcccatga ccatcagttt 
attacaccta acggcatcgt cgataagcaa 
gatgtgcaga aaaaattgac tgaactatac 
ttttttgata cctaaatgtg aaaggagatc 
gtggagtagg cgggtatatt ggcggacggc 
tcgtgcgcca aaaacgagct gagcagctta 
aagggaatgt atcatttcag cccgaactaa 
tcgttatcat tgcttctaaa gcatactcgc 
ttatccatca agaatctgtc attatccctt 
tatttgcggc attttcaaaa gaacaggtgc 
tagacaacaa aggagaaatt catcatacga 
ggaacggcga gcgtacggag cggataagag 
ctgaagtcat cattagcggg catatcgaga 
atcaagataa gaaagaaaag gatttttcgc 
gaccacattt tttaatgtgg tctttattct 
gaaaattgga taaagtggga tatttttaaa 
tttaaaaaag gattgattct aatgaagaaa 
agataaaaat ttaggaggca tatcaaatga 
agagaaaaga gatatttaat cattatttga 
aaattgatat tagtgtttta taccgaaaca 
catttatttt cttagtgaca agggtgataa 
atagcgacgg agagttaggt tattgggata 
gtgtatctaa aacattctct ggtatttgga 
atgatttata cctttctgat gtagagaaat 
cacctatacc tgaaaatgct ttttctcttt 
ttaacttaaa tatcaataat aatagtaatt 
tcattaataa aggtaattca atatatttac 
gtgatggtta tcatgcagga ttgtttatga 
atgactggct tttataatat gagataatgc 
tgtcactaac ctgccccgtt agttgaagaa 
gggatcacta gtccaaacga cagaaggcga 
aaaacgttta tctctcgctg caccagtatt 
tgaagcagaa aaataaaaaa aggaggcgga 
aatttccgtt tctgaagata ggctctcttt 
tatccttaga accgatcata aagtccacgt 
caagctcttc acgagacatc tgctttccgc 
tcgccaaatg atttgacgcg ttttcatcaa 
gtgatatagg cgaatcgtaa ggaacaagtg 
ctgtttccac cagttctttt aaaatatcct 
ggccattttc aaacgtcagg gtgaaatttt 
ttgtgcttga taccactccg tcaaccccgt 
tcggcatatt ggccataaac tcatggccac 
catgcttcct aggcagctta attgttagat 
cgtaatgtct ctcgttcaaa tggtcaactt 
acgcctgaac cgggttgtct tcatttacgc 
ggtggatcgc ttcctcctct gatttgccag 
ccgcacctac gacagtccag ctgactttgt 
atgctttgcc tgctgctttt tggaatgccg 
agtctgggtt cgacgacaca acagaaatga 
tgccttttgc ttcccattca ggatattctt 
ttaatttggc gacttcgtca tcctgccaat 
atgcgtgttt tacaattaaa cggacaaaat 
catactggcc tttttggaca ttaacgc 
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ttacgcggtt tcttaggacg catgctgttc 540 
gtactttccg ggggagaaaa ggtccgctgt 600 
aatattttaa ttttggatga gccgaccaac 660 
aataacggct taatcagctt taaaggcgct 720 
gtgcagacca ttgccaacag aattatagaa 780 
atgagctatg acgagttcct tgaaaatgct 840 
gccgaataaa aaagcagaga tttctctgct 900 
acaacatgaa atttttggtt gtcggagcag 960 
tttcggagaa aggaaatgat gtgacatttc 1020 
aaaaaaccgg gcttgtcatc catagtgaaa 1080 
tcagtgccgg agaaacaggg caatttgatg 1140 
ttggtcaagt gatagaggat gtcaaaccat 1200 
ttttaaatgg gtaccgccac tatgagcagc 1260 
tgggcggcct gtgttttata gaaagtgctt 1320 
gcgcatcgca tcgttttgta tttggagaat 1380 
cgcttgaaga ggcattttca ggtgtgaagg 1440 
agatcccctg cagcaatagt tacccttatt 1500 
tacgctcaaa tcctttaaaa aaacacaaaa 1560 
tcaactaaag cacccattag ttcaacaaac 1620 
atatatattt atgttacagt aatattgact 1680 
gcagacaagt aagcctccta aattcacttt 1740 
actttaataa aattgattta gacaattgga 1800 
accaacaaac gacttttagt ataaccacag 1860 
taaaacaaga aggatataaa ttttaccctg 1920 
actcaaatac agcttttaga actggttaca 1980 
agttagagcc actttataca atttttgatg 2040 
ctcctgtaaa gaatgacttc aaagagtttt 2100 
ataatggttc ggggaaattg tttcccaaaa 2160 
ctattattcc atggacttca tttactgggt 2220 
accttctacc cattattaca gcaggaaaat 2280 • 
cgctatcttt acaggtacat cattctgttt 2340 . 
actctattca ggaattgtca gataggccta 2400 
cgactgtact ttttacagtc ggttttctaa 2460 
ggtttttata ttacagctcc cgggagatct 2520 
ccacctgcat ggatttttga ttgaaaaagc 2580 
agaaaccgtt tatgcgaatc tgcaaatgta 2640 
aaagcctcct tttatttact taaaaagccc 2700 
tcccgtctgc cgtaattccg tcaatattca 27 60 
gtgtaatgct ttcatttagg ccttctttga 2820 
cttcaatatt aaaggcatag gcgcttccga 2880 
acagcgtgtt atagaaaaga atgtttgatt 2940 
ccacttcacc taaatagtga gaaccttcat 3000 
cacctttttc agctttaatg tcgactatac 3060 
caataatatt tccgccgtag cttaatggtt 3120 
ctttttgcgg cagcgtgaac acttcttctg 3180 
tttcattcac gcttcccgca cctgcccaaa 3240 
cagttccttc tgcttgataa tgtaaggcag 3300 
tttcatgaag attttggtca tgattgatcc 3360 
gcgtcgcttt aaaaatttct tcccacagaa 3420 
gaaacacctt gtgagcccag cctgctgatg 3480 
ctgattgaat atattgtctg tatgtatgta 3540 
caatccgttt ggaatctata ccttttagca 3600 
aagcagctcc atttttggca agctcttctc 3660 
caaatgcttc aaacggcgca agttcgtatt 3720 
tcacggtgac gttttttgcg cccttttcat 3780 
cccgaacgtc tgttgaagca tttacgacta 3840 

3867 
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<210> 25 
<211> 8704 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: vector 

<220> 

<223> pAN238 
<400> 25 

gcggccgctt cgtcgaccga aacagcagtt ataaggcatg aagctgtccg gtttttgcaa 60 
aagtggctgt gactgtaaaa agaaatcgaa aaagaccgtt ttgtgtgaaa acggtctttt 120 
tgtttccttt taaccaactg ccataactcg aggcctacct agcttccaag aaagatatcc 180 
taacagcaca agagcggaaa gatgttttgt tctacatcca gaacaacctc tgctaaaatt 240 
cctgaaaaat tttgcaaaaa gttgttgact ttatctacaa ggtgtggtat aataatctta 300 
acaacagcag gacgctctag aggaggagac taacatgaaa tttttggttg tcggagcagg 360 
tggagtaggc gggtatattg gcggacggct ttcggagaaa ggaaatgatg tgacatttct 420 
cgtgcgccaa aaacgagctg agcagcttaa aaaaaccggg cttgtcatcc atagtgaaaa 4 80 
agggaatgta tcatttcagc ccgaactaat cagtgccgga gaaacagggc aatttgatgt 54 0 
cgttatcatt gcttctaaag catactcgct tggtcaagtg atagaggatg tcaaaccatt 600 
tatccatcaa gaatctgtca ttatcccttt tttaaatggg taccgccact atgagcagct 660 
atttgcggca ttttcaaaag aacaggtgct gggcggcctg tgttttatag aaagtgcttt 720 
agacaacaaa ggagaaattc atcatacgag cgcatcgcat cgttttgtat ttggagaatg 780 
gaacggcgag cgtacggagc ggataagagc gcttgaagag gcattttcag gtgtgaaggc 840 
tgaagtcatc attagcgggc atatcgagaa ggacatttgg aaaaagtatc tctttattgc 900 
agcgcaagcg gggatcacaa cgttatttca acgaccgctt ggcccaatcc tcgccacaga 960 
agccggacgt cacacggccc aaactcttat tggggaaatt tgcactgttt tacgaaaaga 1020 
aggtgttccg gctgatccgg ctcttgagga agagagcttt cgtacgatga ccagcatgtc 1080 
ttaccatatg aagtcctcca tgcttcggga tatggaaaac ggccaaacga cagaaggcga 1140 
ccacctgcat ggatttttga ttgaaaaagc aaaacgttta tctctcgctg caccagtatt 1200 
agaaaccgtt tatgcgaatc tgcaaatgta tgaagcagaa aaataaaaaa aggaggcgga 1260 
aaagcctcct tttatttact taaaaagccc aatttccgtt tctgaagata ggctctcttt 1320 
tcccgtctgc cgggatcctg ttttggcgga tgagagaaga ttttcagcct gatacagatt 1380 
aaatcagaac gcagaagcgg tctgataaaa cagaatttgc ctggcggcag tagcgcggtg 14 4 0 
gtcccacctg accccatgcc gaactcagaa 'gtgaaacgcc gtagcgccga tggtagtgtg 1500 
gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa aggctcagtc 1560 
gaaagactgg gcctttcgtt ttatctgttg tttgtcggtg aacgctctcc tgagtaggac 1620 
aaatccgccg ggagcggatt tgaacgttgc gaagcaacgg cccggagggt ggcgggcagg 1680 
acgcccgcca taaactgcca ggcatcaaat taagcagaag gccatcctga cggatggcct 1740 
ttttgcgttt ctacaaactc ttggtaccca gaaaaagcgg caaaagcggc tgttaaaaaa 1800 
gcgaaatcga. agaagctgtc tgccgctaag acggaatatc aaaagcgttc tgctgttgtg 1860 
tcatctttaa aagtcacagc cgatgaatcc cagcaagatg tcctaaaata cttgaacacc 1920 
cagaaagata aaggaaatgc agaccaaatt cattcttatt atgtggtgaa cgggattgct 1980 
gttcatgcct caaaagaggt tatggaaaaa gtggtgcagt ttcccgaagt ggaaaaggtg 2040 
cttcctaatg agaaacggca gctttttaag tcatcctccc catttaatat gaaaaaagca 2100 
cagaaagcta ttaaagcaac tgacggtgtg gaatggaatg tagaccaaat cgatgcccca 2160 
aaagcttggg cacttggata tgatggaact ggcacggttg ttgcgtccat tgataccggg 2220 
gtggaatgga atcatccggc attaaaagag aaatatcgcg gatataatcc ggaaaatcct 2280 
aatgagcctg aaaatgaaat gaactggtat gatgccgtag caggcgaggc aagcccttat 2340 
gatgatttgg ctcatggaac ccacgtgaca ggcacgatgg tgggctctga acctgatgga 2400 
acaaatcaaa tcggtgtagc acctggcgca aaatggattg ctgttaaagc gttctctgaa 24 60 
gatggcggca ctgatgctga cattttggaa gctggtgaat gggttttagc accaaaggac 2520 
gcggaaggaa atccccaccc ggaaatggct cctgatgttg tcaataactc atggggaggg 2580 
ggctctggac ttgatgaatg gtacagagac atggtcaatg cctggcgttc ggccgatatt 2640 
ttccctgagt tttcagcggg gaatacggat ctctttattc ccggcgggcc tggttctatc 2700 
gcaaatccgg caaactatcc agaatcgttt gcaactggag cgactgagaa ttccaattcc 27 60 
ccatggagag aaaagaaaat cgctaatgtt gattactttg aacttctgca tattcttgaa 2820 
tttaaaaagg ctgaaagagt aaaagattgt gctgaaatat tagagtataa acaaaatcgt 2880 
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gaaacaggcg aaagaaagtt gtatcgagtg tggttttgta aatccaggct ttgtccaatg 2940 

tgcaactgga ggagagcaat gaaacatggc attcagtcac aaaaggttgt tgctgaagtt 3000 

attaaacaaa agccaacagt tcgttggttg tttctcacat taacagttaa aaatgtttat 3060 

gatggcgaag aattaaataa gagtttgtca gatatggctc aaggatttcg ccgaatgatg 3120 

caatataaaa aaattaataa aaatcttgtt ggttttatgc gtgcaacgga agtgacaata 3180 

aataataaag ataattctta taatcagcac atgcatgtat tggtatgtgt ggaaccaact 324 0 

tattttaaga atacagaaaa ctacgtgaat caaaaacaat ggattcaatt ttggaaaaag 3300 

gcaatgaaat tagactatga tccaaatgta aaagttcaaa tgattcgacc gaaaaataaa 3360 

tataaatcgg atatacaatc ggcaattgac gaaactgcaa aatatcctgt aaaggatacg 3420 

gattttatga ccgatgatga agaaaagaat ttgaaacgtt tgtctgattt ggaggaaggt 34 80 

ttacaccgta aaaggttaat ctcctatggt ggtttgttaa aagaaataca taaaaaatta 3540 

aaccttgatg acacagaaga aggcgatttg attcatacag atgatgacga aaaagccgat 3600 

gaagatggat tttctattat tgcaatgtgg aattgggaac ggaaaaatta ttttattaaa 3660 

gagtagttca acaaacgggc catattgttg tataagtgat gaaatactga atttaaaact 3720 

tagtttatat gtggtaaaat gttttaatca agtttaggag gaattaatta tgaagtgtaa 3780 

tgaatgtaac agggttcaat taaaagaggg aagcgtatca ttaaccctat aaactacgtc 384 0 

tgccctcatt attggagggt gaaatgtgaa tacatcctat tcacaatcga atttacgaca 3900 

caaccaaatt ttaatttggc tttgcatttt atcttttttt agcgtattaa atgaaatggt 3960 

tttgaacgtc tcattacctg atattgcaaa tgattttaat aaaccacctg cgagtacaaa 4020 

ctgggtgaac acagccttta tgttaacctt ttccattgga acagctgtat atggaaagct 4080 

atctgatcaa ttaggcatca aaaggttact cctatttgga attataataa attgtttcgg 414 0 

gtcggtaatt gggtttgttg gccattcttt cttttcctta cttattatgg ctcgttttat 4200 

tcaaggggct ggtgcagctg catttccagc actcgtaatg gttgtagttg cgcgctatat 4260 

tccaaaggaa aataggggta aagcatttgg tcttattgga tcgatagtag ccatgggaga 4320 

aggagtcggt ccagcgattg gtggaatgat agcccattat attcattggt cctatcttct 4380 

actcattcct atgataacaa ttatcactgt tccgtttctt atgaaattat taaagaaaga 4440 

agtaaggata aaaggtcatt ttgatatcaa aggaattata ctaatgtctg taggcattgt 4500 

attttttatg ttgtttacaa catcatatag catttctttt cttatcgtta gcgtgctgtc 4560 

attcctgata tttgtaaaac atatcaggaa agtaacagat ccttttgttg atcccggatt 4 620 - 

agggaaaaat atacctttta tgattggagt tctttgtggg ggaattatat ttggaacagt 4680 • 

agcagggttt gtctctatgg ttccttatat gatgaaagat gttcaccagc taagtactgc 474 0 • 

cgaaatcgga agtgtaatta ttttccctgg aacaatgagt gtcattattt tcggctacat 4 800 

tggtgggata cttgttgata gaagaggtcc tttatacgtg ttaaacatcg gagttacatt 4 8 60 

tctttctgtt agctttttaa ctgcttcctt tcttttagaa acaacatcat ggttcatgac 4920 

aattataatc gtatttgttt taggtgggct ttcgttcacc aaaacagtta tatcaacaat 4980 •. 

tgtttcaagt agcttgaaac agcaggaagc tggtgctgga atgagtttgc ttaactttac 5040 

cagcttttta tcagagggaa caggtattgc aattgtaggt ggtttattat ccataccctt 5100 

acttgatcaa aggttgttac ctatggaagt tgatcagtca acttatctgt atagtaattt 5160 

gttattactt ttttcaggaa tcattgtcat tagttggctg gttaccttga atgtatataa 5220 

acattctcaa agggatttct aaatcgttaa gggatcaact ttgggagaga gttcaaaatt 5280 

gatccttttt ttataacagt tcgaagcggc cgcaattctt gaagacgaaa gggcctcgtg 5340 

atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc 54 00 

acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat 54 60 

atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag 5520 

agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt 5580 

cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt 5640 

gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga gagttttcgc 5700 

cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta 57 60 

tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac 5820 

ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa 5880 

ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg 594 0 

atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc 6000 

cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg 6060 

atgcctgcag caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta 6120 

gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg accacttctg 6180 

cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg 6240 

tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc 6300 

tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt 6360 

gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt 6420 

gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc 6480 



-17- 



wo 02/057476 



PCT/US02/01887 



atgaccaaaa tcccttaacg tgagttttcg 
atcaaaggat cttcttgaga tccttttttt 
aaaccaccgc taccagcggt ggtttgtttg 
aaggtaactg gcttcagcag agcgcagata 
ttaggccacc acttcaagaa ctctgtagca 
ttaccagtgg ctgctgccag tggcgataag 
tagttaccgg ataaggcgca gcggtcgggc 
ttggagcgaa cgacctacac cgaactgaga 
acgcttcccg aagggagaaa ggcggacagg 
gagcgcacga gggagcttcc agggggaaac 
cgccacctct gacttgagcg tcgatttttg 
aaaaacgcca gcaacgcggc ctttttacgg 
atgttctttc ctgcgttatc ccctgattct 
gctgataccg ctcgccgcag ccgaacgacc 
gaagagcgcc tgatgcggta ttttctcctt 
tggtgcactc tcagtacaat ctgctctgat 
tatcgctacg tgactgggtc atggctgcgc 
cctgacgggc ttgtctgctc ccggcatccg 
gctgcatgtg tcagaggttt tcaccgtcat 
gctcatcagc gtggtcgtga agcgattcac 
cgttgagttt ctccagaagc gttaatgtct 
cggttttttc ctgtttggtc acttgatgcc 
ggtaatgata ccgatgaaac gagagaggat 
tgcccggtta ctggaacgtt gtgagggtaa 
gagaaaaatc actcagggtc aatgccagcg 
gggtagccag cagcatcctg cgatgcagat 
ccgcgtttcc agactttacg aaacacggaa 
cgcagacgtt ttgcagcagc agtcgcttca 
ctaaccagta aggcaacccc gccagcctag 
gcgcacccgt ggccaggacc caacgctgcc 
ggcggacgcg atggatatgt tctgccaagg 
gaattgattg gctccaattc ttggagtggt 
attcaggtcg aggtggcccg gctccatgca 
tatagggcgg cgcctacaat ccatgccaac 
atcgccgtga cgatcagcgg tccagtgatc 
ccttgaagct gtccctgatg gtcgtcatct 
ggcatcccga tgccgccgga agcgagaaga 
gtcg 



ttccactgag cgtcagaccc cgtagaaaag 654 0 
ctgcgcgtaa tctgctgctt gcaaacaaaa 6600 
ccggatcaag agctaccaac tctttttccg 6660 
ccaaatactg tccttctagt gtagccgtag 6720 
ccgcctacat acctcgctct gctaatcctg 6780 
tcgtgtctta ccgggttgga ctcaagacga 6840 
tgaacggggg gttcgtgcac acagcccagc 6900 
tacctacagc gtgagctatg agaaagcgcc 6960 
tatccggtaa gcggcagggt cggaacagga 7020 
gcctggtatc tttatagtcc tgtcgggttt 7080 
tgatgctcgt caggggggcg gagcctatgg 7140 
ttcctggcct tttgctggcc ttttgctcac 7200 
gtggataacc gtattaccgc ctttgagtga 7260 
gagcgcagcg agtcagtgag cgaggaagcg 7320 
acgcatctgt gcggtatttc acaccgcata 7380 
gccgcatagt taagccagta tacactccgc 7440 
cccgacaccc gccaacaccc gctgacgcgc 7500 
cttacagaca agctgtgacc gtctccggga 7560 
caccgaaacg cgcgaggcag ctgcggtaaa 7620 
agatgtctgc ctgttcatcc gcgtccagct 7680 
ggcttctgat aaagcgggcc atgttaaggg 774 0 
tccgtgtaag ggggaatttc tgttcatggg 7800 
gctcacgata cgggttactg atgatgaaca 7860 
acaactggcg gtatggatgc ggcgggacca 7920 
cttcgttaat acagatgtag gtgttccaca 7 980 
ccggaacata atggtgcagg gcgctgactt 8040 
accgaagacc attcatgttg ttgctcaggt 8100 
cgttcgctcg cgtatcggtg attcattctg 8160 
ccgggtcctc aacgacagga gcacgatcat 8220 
cgagatgcgc cgcgtgcggc tgctggagat 8280 
gttggtttgc gcattcacag ttctccgcaa 8340 
gaatccgtta gcgaggtgcc gccggcttcc 84 00 
ccgcgacgca acgcggggag gcagacaagg 84 60 
ccgttccatg tgctcgccga ggcggcataa 8520 
gaagttaggc tggtaagagc cgcgagcgat 8580 
acctgcctgg acagcatggc ctgcaacgcg 8 64 0 
atcataatgg ggaaggccat ccagcctcgc 8700 

8704 



-18- 



